public class FileBackedBilingualCorpus extends java.lang.Object implements BilingualCorpus
BilingualCorpus
interface, formed
by two aligned FileBackedMonolingualCorpus
instances.Modifier and Type | Method and Description |
---|---|
static FileBackedBilingualCorpus |
getInstance(FileBackedMonolingualCorpus src,
FileBackedMonolingualCorpus trg,
CorpusAlignment alignments,
LexicalWeighting srcToTrgWeights,
LexicalWeighting trgToSrcWeights,
Dictionary dictionary)
Creates a bilingual corpus by aligning the given
FileBackedMonolingualCorpus pair. |
static FileBackedBilingualCorpus |
getInstance(java.io.File srcDir,
java.lang.String srcId,
java.io.File trgDir,
java.lang.String trgId)
Creates a previously aligned bilingual corpus backed in the given
directories.
|
static FileBackedBilingualCorpus |
getInstance(java.io.File dir,
java.lang.String srcId,
java.lang.String trgId)
Creates a previously aligned bilingual corpus backed in the given
directory.
|
MonolingualCorpus |
getSrc()
Returns the source
MonolingualCorpus of this bilingual corpus. |
MonolingualCorpus |
getTrg()
Returns the target
MonolingualCorpus of this bilingual corpus. |
java.lang.String |
translate(java.util.List<PhraseElement> fragment,
Dictionary dictionary,
Generator generator,
boolean tokenize)
Returns the translation of the given text fragment in this bilingual
corpus.
|
public static FileBackedBilingualCorpus getInstance(java.io.File dir, java.lang.String srcId, java.lang.String trgId) throws java.io.IOException
dir
- the directory in which the source and target FileBackedMonolingualCorpus
es are backed.srcId
- the unique identifier of the source FileBackedMonolingualCorpus
in the given directory.trgId
- the unique identifier of the target FileBackedMonolingualCorpus
monolingual corpus in the given directory.FileBackedBilingualCorpus
that corresponds to the specified arguments.java.io.IOException
- if some sort of I/O error occurs.public static FileBackedBilingualCorpus getInstance(java.io.File srcDir, java.lang.String srcId, java.io.File trgDir, java.lang.String trgId) throws java.io.IOException
srcDir
- the directory in which the source FileBackedMonolingualCorpus
is backed.srcId
- the unique identifier of the source FileBackedMonolingualCorpus
.trgDir
- the directory in which the target FileBackedMonolingualCorpus
is backed.trgId
- the unique identifier of the target FileBackedMonolingualCorpus
.FileBackedBilingualCorpus
that corresponds to the specified arguments.java.io.IOException
- if some sort of I/O error occurs.public static FileBackedBilingualCorpus getInstance(FileBackedMonolingualCorpus src, FileBackedMonolingualCorpus trg, CorpusAlignment alignments, LexicalWeighting srcToTrgWeights, LexicalWeighting trgToSrcWeights, Dictionary dictionary) throws java.io.IOException
FileBackedMonolingualCorpus
pair.src
- the source FileBackedMonolingualCorpus
to align.trg
- the target FileBackedMonolingualCorpus
to align.alignments
- the word alignment between the source and the target corpora.srcToTrgWeights
- the LexicalWeighting
from the source language to the target language.trgToSrcWeights
- the LexicalWeighting
from the target language to the source language.dictionary
- the Dictionary
which with to align entities.FileBackedBilingualCorpus
resulting from aligning the given FileBackedMonolingualCorpus
pair.java.io.IOException
- if some sort of I/O error occurs.public MonolingualCorpus getSrc()
BilingualCorpus
MonolingualCorpus
of this bilingual corpus.getSrc
in interface BilingualCorpus
MonolingualCorpus
of this bilingual corpus.public MonolingualCorpus getTrg()
BilingualCorpus
MonolingualCorpus
of this bilingual corpus.getTrg
in interface BilingualCorpus
MonolingualCorpus
of this bilingual corpus.public java.lang.String translate(java.util.List<PhraseElement> fragment, Dictionary dictionary, Generator generator, boolean tokenize)
BilingualCorpus
null
.translate
in interface BilingualCorpus
fragment
- the text fragment to translate.dictionary
- the Dictionary
with which to translate entities.generator
- the Generator
with which to inflect the translated entities.tokenize
- whether the resulting translation should be tokenized or not.null
if no match is found.