Extract images from office document

At times you might need to extract images from a powerpoint presentation or a word document. It can be very tedious job to extract those images individually. Moreover, this is a time consuming activity.
This can be easily done by renamming the powerpoint/word file into a zip file and the extracting data from it.
Here is how a typical powerpoint file looks when unzipped.

Folder structure
Directory structure of powerpoint file

Above tree structure reveals how infromation is organized in microsoft office powerpoint file.

  • docProps folder is used to store file properties and thumbnail information
  • ppt folder contains information on various ojects used in a presentation
  • _rels folder defines relationship among various files

To get all images in a microsoft office document, one can look into media folder, which contains all images used in an office document.
Similarly, other folders contain information about other objects used in an office document.

You can read more about this tree structure at MSDN.

Leave a Reply

Your email address will not be published. Required fields are marked *