Package | Description |
---|---|
cc.redpen.config |
Configuration and SymbolTable are provided.
|
cc.redpen.model |
Elements of Documents such as List, Sentence are provided.
|
cc.redpen.parser |
Parser and the implementations are provided.
|
cc.redpen.tokenizer |
Provides tokenizers for each languages.
|
Modifier and Type | Method and Description |
---|---|
RedPenTokenizer |
Configuration.getTokenizer()
returns Tokenizer aasociated with this configuration
|
Constructor and Description |
---|
DocumentBuilder(RedPenTokenizer tokenizer)
Constructor.
|
Modifier and Type | Method and Description |
---|---|
Document |
DocumentParser.parse(File file,
SentenceExtractor sentenceExtractor,
RedPenTokenizer tokenizer)
Given input file name, return Document instance for the specified file.
|
Document |
BaseDocumentParser.parse(File file,
SentenceExtractor sentenceExtractor,
RedPenTokenizer tokenizer) |
Document |
PlainTextParser.parse(InputStream is,
Optional<String> fileName,
SentenceExtractor sentenceExtractor,
RedPenTokenizer tokenizer) |
Document |
DocumentParser.parse(InputStream io,
Optional<String> fileName,
SentenceExtractor sentenceExtractor,
RedPenTokenizer tokenizer)
Given input stream, return Document instance from a stream.
|
Document |
AsciiDocParser.parse(InputStream io,
Optional<String> fileName,
SentenceExtractor sentenceExtractor,
RedPenTokenizer tokenizer) |
default Document |
DocumentParser.parse(InputStream is,
SentenceExtractor sentenceExtractor,
RedPenTokenizer tokenizer)
Given input stream, return Document instance from a stream.
|
Document |
DocumentParser.parse(String content,
SentenceExtractor sentenceExtractor,
RedPenTokenizer tokenizer)
Given content, return Document instance for the specified file.
|
Document |
BaseDocumentParser.parse(String content,
SentenceExtractor sentenceExtractor,
RedPenTokenizer tokenizer) |
Modifier and Type | Class and Description |
---|---|
class |
JapaneseTokenizer |
class |
WhiteSpaceTokenizer |
Copyright © 2015. All rights reserved.