Modifier and Type | Class and Description |
---|---|
class |
POIOLE2TextExtractor
Common Parent for OLE2 based Text Extractors
of POI Documents, such as .doc, .xls
You will typically find the implementation of
a given format's text extractor under
org.zkoss.poi.[format].extractor .
|
class |
POIXMLPropertiesTextExtractor
A
POITextExtractor for returning the textual
content of the OOXML file properties, eg author
and title. |
class |
POIXMLTextExtractor |
Modifier and Type | Method and Description |
---|---|
abstract POITextExtractor |
POITextExtractor.getMetadataTextExtractor()
Returns another text extractor, which is able to
output the textual content of the document
metadata / properties, such as author and title.
|
POITextExtractor |
POIOLE2TextExtractor.getMetadataTextExtractor()
Returns an HPSF powered text extractor for the
document properties metadata, such as title and author.
|
Constructor and Description |
---|
POITextExtractor(POITextExtractor otherExtractor)
Creates a new text extractor, using the same
document as another text extractor.
|
Modifier and Type | Method and Description |
---|---|
static POITextExtractor |
ExtractorFactory.createExtractor(DirectoryNode poifsDir) |
static POITextExtractor |
ExtractorFactory.createExtractor(DirectoryNode poifsDir,
POIFSFileSystem fs)
Deprecated.
Use
ExtractorFactory.createExtractor(DirectoryNode) instead |
static POITextExtractor |
ExtractorFactory.createExtractor(File f) |
static POITextExtractor |
ExtractorFactory.createExtractor(InputStream inp) |
static POITextExtractor[] |
ExtractorFactory.getEmbededDocsTextExtractors(POIOLE2TextExtractor ext)
Returns an array of text extractors, one for each of
the embeded documents in the file (if there are any).
|
static POITextExtractor[] |
ExtractorFactory.getEmbededDocsTextExtractors(POIXMLTextExtractor ext)
Returns an array of text extractors, one for each of
the embeded documents in the file (if there are any).
|
Modifier and Type | Class and Description |
---|---|
class |
VisioTextExtractor
Class to find all the text in a Visio file, and return it.
|
Modifier and Type | Class and Description |
---|---|
class |
PublisherTextExtractor
Extract text from HPBF Publisher files
|
Modifier and Type | Class and Description |
---|---|
class |
HPSFPropertiesExtractor
Extracts all of the HPSF properties, both
build in and custom, returning them in
textual form.
|
Modifier and Type | Method and Description |
---|---|
POITextExtractor |
HPSFPropertiesExtractor.getMetadataTextExtractor()
Prevent recursion!
|
Constructor and Description |
---|
HPSFPropertiesExtractor(POITextExtractor mainExtractor) |
Modifier and Type | Class and Description |
---|---|
class |
PowerPointExtractor
This class can be used to extract text from a PowerPoint file.
|
Modifier and Type | Class and Description |
---|---|
class |
OutlookTextExtactor
A text extractor for HSMF (Outlook) .msg files.
|
Modifier and Type | Class and Description |
---|---|
class |
EventBasedExcelExtractor
A text extractor for Excel files, that is based
on the hssf eventusermodel api.
|
class |
ExcelExtractor
A text extractor for Excel files.
|
Modifier and Type | Class and Description |
---|---|
class |
Word6Extractor
Class to extract the text from old (Word 6 / Word 95) Word Documents.
|
class |
WordExtractor
Class to extract the text from a Word Document.
|
Modifier and Type | Class and Description |
---|---|
class |
XSLFPowerPointExtractor |
Modifier and Type | Class and Description |
---|---|
class |
XSSFEventBasedExcelExtractor
Implementation of a text extractor from OOXML Excel
files that uses SAX event based parsing.
|
class |
XSSFExcelExtractor
Helper class to extract text from an OOXML Excel file
|
Modifier and Type | Class and Description |
---|---|
class |
XWPFWordExtractor
Helper class to extract text from an OOXML Word file
|
Copyright © 2020. All rights reserved.