public final class CmsExtractorHtml extends A_CmsTextExtractor
| 限定符和类型 | 方法和说明 |
|---|---|
I_CmsExtractionResult |
extractText(java.io.InputStream in,
java.lang.String encoding)
Extracts the text and meta information from the document on the input stream, using the specified content encoding.
|
static I_CmsTextExtractor |
getExtractor()
Returns an instance of this text extractor.
|
combineContentItem, extractText, extractText, extractText, extractText, removeControlCharspublic static I_CmsTextExtractor getExtractor()
public I_CmsExtractionResult extractText(java.io.InputStream in, java.lang.String encoding) throws java.lang.Exception
I_CmsTextExtractor
The encoding is a hint for the text extractor, if the value given is null then
the text extractor should try to figure out the encoding itself.
extractText 在接口中 I_CmsTextExtractorextractText 在类中 A_CmsTextExtractorin - the input stream for the document to extract the text fromencoding - the encoding to usejava.lang.Exception - if the text extration failsI_CmsTextExtractor.extractText(java.io.InputStream, java.lang.String)