public interface I_CmsExtractionResult
This data structure contains the extracted text as well as (optional) meta information extracted from the document.
| 限定符和类型 | 字段和说明 |
|---|---|
static java.lang.String |
ITEM_AUTHOR
Key to access the document author name in the item map.
|
static java.lang.String |
ITEM_CATEGORY
Key to access the document category in the item map.
|
static java.lang.String |
ITEM_COMMENTS
Key to access the document comments in the item map.
|
static java.lang.String |
ITEM_COMPANY
Key to access the document company name in the item map.
|
static java.lang.String |
ITEM_CONTENT
Key for accessing the default (combined) content in
getContentItems(). |
static java.lang.String |
ITEM_CREATOR
Key to access the document creator name in the item map.
|
static java.lang.String |
ITEM_KEYWORDS
Key to access the document keywords in the item map.
|
static java.lang.String |
ITEM_MANAGER
Key to access the document manager name in the item map.
|
static java.lang.String |
ITEM_PRODUCER
Key to access the document producer name in the item map.
|
static java.lang.String |
ITEM_RAW
Key for accessing the raw content in
getContentItems(). |
static java.lang.String |
ITEM_SUBJECT
Key to access the document subject in the item map.
|
static java.lang.String |
ITEM_TITLE
Key to access the document title in the item map.
|
| 限定符和类型 | 方法和说明 |
|---|---|
byte[] |
getBytes()
Returns this extraction result serialized as a byte array.
|
java.lang.String |
getContent()
Returns the extracted content combined as a String.
|
java.util.Map<java.lang.String,java.lang.String> |
getContentItems()
Returns the extracted content as individual items.
|
void |
release()
Releases the information stored in this extraction result, to free up the memory used.
|
static final java.lang.String ITEM_AUTHOR
static final java.lang.String ITEM_CATEGORY
static final java.lang.String ITEM_COMMENTS
static final java.lang.String ITEM_COMPANY
static final java.lang.String ITEM_CONTENT
getContentItems().static final java.lang.String ITEM_CREATOR
static final java.lang.String ITEM_KEYWORDS
static final java.lang.String ITEM_MANAGER
static final java.lang.String ITEM_PRODUCER
static final java.lang.String ITEM_RAW
getContentItems().static final java.lang.String ITEM_SUBJECT
static final java.lang.String ITEM_TITLE
byte[] getBytes()
java.lang.String getContent()
java.util.Map<java.lang.String,java.lang.String> getContentItems()
The result Map contains all content items extracted by the extractor. The key is always a String, and contains the name of the item. The value is also a String and contains the extracted text.
The detailed form will depend on the resource type indexed:
xmlpage, the key will be the element name, and the value
will be the text of the element.
xmlcontent, the key will be the xpath of the XML node,
and the value will be the text of that XML node.
ITEM_CONTENT,
which will contain the value of the complete content.
void release()