Class OLE2ScratchpadExtractorFactory

java.lang.Object
org.apache.poi.extractor.ole2.OLE2ScratchpadExtractorFactory

public class OLE2ScratchpadExtractorFactory extends Object
Scratchpad-specific logic for OLE2ExtractorFactory and
invalid reference
org.apache.poi.extractor.ExtractorFactory
, which permit the other two to run with no Scratchpad jar (though without functionality!)

Note - should not be used standalone, always use via the other two classes

  • Constructor Details

    • OLE2ScratchpadExtractorFactory

      public OLE2ScratchpadExtractorFactory()
  • Method Details

    • createExtractor

      public static POITextExtractor createExtractor(DirectoryNode poifsDir) throws IOException
      Look for certain entries in the stream, to figure it out what format is desired Note - doesn't check for core-supported formats! Note - doesn't check for OOXML-supported formats
      Parameters:
      poifsDir - the directory node to be inspected
      Returns:
      the format specific text extractor
      Throws:
      IOException - when the format specific extraction fails because of invalid entires
    • identifyEmbeddedResources

      public static void identifyEmbeddedResources(POIOLE2TextExtractor ext, List<Entry> dirs, List<InputStream> nonPOIFS) throws IOException
      Returns an array of text extractors, one for each of the embedded documents in the file (if there are any). If there are no embedded documents, you'll get back an empty array. Otherwise, you'll get one open POITextExtractor for each embedded file.
      Parameters:
      ext - the extractor holding the directory to start parsing
      dirs - a list to be filled with directory references holding embedded
      nonPOIFS - a list to be filled with streams which aren't based on POIFS entries
      Throws:
      IOException - when the format specific extraction fails because of invalid entires