Index

A B C D E F G H I L M N O P R S T U V W _ 
All Classes and Interfaces|All Packages|Constant Field Values|Serialized Form

A

add(String) - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
Add n-gram to profile
add(String) - Method in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
addChar(char) - Method in class com.optimaize.langdetect.cybozu.util.NGram
 
addCharSequence(LangProfile, CharSequence) - Static method in class com.optimaize.langdetect.cybozu.util.Util
 
addGram(String) - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
Shortcut for addGram(ngram, 1).
addGram(String, int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
If the builder already has this ngram, the given frequency is added to the current count.
addOpt(String, String, String) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
 
addText(CharSequence) - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
In order to use this you must set the LanguageProfileBuilder.ngramExtractor first.
affixFactor(double) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
Sets prefixFactor() and suffixFactor() both to the given value.
alpha - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
alpha - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
alpha(double) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
 
ALPHA_DEFAULT - Static variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
ALPHA_WIDTH - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
TODO document what this is for, and why that value is chosen.
append(char) - Method in class com.optimaize.langdetect.text.TextObject
 
append(Reader) - Method in class com.optimaize.langdetect.text.TextObject
Append the target text for language detection.
append(CharSequence) - Method in class com.optimaize.langdetect.text.TextObject
Append the target text for language detection.
append(CharSequence, int, int) - Method in class com.optimaize.langdetect.text.TextObject
 
applyPadding(CharSequence) - Method in class com.optimaize.langdetect.ngram.NgramExtractor
 
arglist - Variable in class com.optimaize.langdetect.cybozu.CommandLineInterface
 
assignLang(String) - Static method in class com.optimaize.langdetect.i18n.LdLocale
 

B

backwards() - Static method in class com.optimaize.langdetect.ngram.NgramExtractors
The old way of doing n-grams.
BACKWARDS - Static variable in class com.optimaize.langdetect.ngram.NgramExtractors
 
BackwardsCompatibleNgramFilter - Class in com.optimaize.langdetect.ngram
Filters those that were not generated by the old n-gram generator.
BackwardsCompatibleNgramFilter() - Constructor for class com.optimaize.langdetect.ngram.BackwardsCompatibleNgramFilter
 
BASE_FREQ - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
TODO document what this is for, and why that value is chosen.
batchTest() - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
Batch Test of Language Detection (--batchtest option)
buf_ - Variable in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
build() - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
 
build() - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
 
build() - Method in class com.optimaize.langdetect.text.TextObjectFactoryBuilder
 
BuiltInLanguages - Class in com.optimaize.langdetect.profiles
 
BuiltInLanguages() - Constructor for class com.optimaize.langdetect.profiles.BuiltInLanguages
 
BUNDLE_NAME - Static variable in class com.optimaize.langdetect.cybozu.util.Messages
 

C

capitalword_ - Variable in class com.optimaize.langdetect.cybozu.util.NGram
 
charAt(int) - Method in class com.optimaize.langdetect.text.TextObject
 
CharNormalizer - Class in com.optimaize.langdetect.cybozu.util
Some character normalization (and exclusion) functionality.
CharNormalizer() - Constructor for class com.optimaize.langdetect.cybozu.util.CharNormalizer
 
CharNormalizerTextFilterImpl - Class in com.optimaize.langdetect.text
Deprecated.
can't be used because it would be a big loss to not inline this code.
CharNormalizerTextFilterImpl() - Constructor for class com.optimaize.langdetect.text.CharNormalizerTextFilterImpl
Deprecated.
 
CJK_CLASS - Static variable in class com.optimaize.langdetect.cybozu.util.CharNormalizer
CJK Kanji Normalization Mapping
cjk_map - Static variable in class com.optimaize.langdetect.cybozu.util.CharNormalizer
 
clear() - Method in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
closeQuietly(Closeable) - Static method in class com.optimaize.langdetect.frma.IOUtils
Deprecated.
use java7 closeable
closeTag(LangProfile) - Method in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
com.optimaize.langdetect - package com.optimaize.langdetect
 
com.optimaize.langdetect.cybozu - package com.optimaize.langdetect.cybozu
Original language detection classes from https://code.google.com/p/language-detection/
com.optimaize.langdetect.cybozu.util - package com.optimaize.langdetect.cybozu.util
Provides the utility classes for language detection.
com.optimaize.langdetect.frma - package com.optimaize.langdetect.frma
 
com.optimaize.langdetect.i18n - package com.optimaize.langdetect.i18n
 
com.optimaize.langdetect.ngram - package com.optimaize.langdetect.ngram
Provides functionality for handling n-grams.
com.optimaize.langdetect.profiles - package com.optimaize.langdetect.profiles
Provides functionality for loading, storing and creating LanguageProfiles.
com.optimaize.langdetect.profiles.util - package com.optimaize.langdetect.profiles.util
 
com.optimaize.langdetect.text - package com.optimaize.langdetect.text
Provides functionality for concatenating and cleaning text that is used as a) learning text to produce
invalid reference
com.optimaize.langdetect.LanguageProfile
s b) for the text for which the language is to be guessed.
CommandLineInterface - Class in com.optimaize.langdetect.cybozu
LangDetect Command Line Interface.
CommandLineInterface() - Constructor for class com.optimaize.langdetect.cybozu.CommandLineInterface
 
CommonTextObjectFactories - Class in com.optimaize.langdetect.text
Contains some standard TextObjectFactorys ready to use for common use cases.
CommonTextObjectFactories() - Constructor for class com.optimaize.langdetect.text.CommonTextObjectFactories
 
compareTo(DetectedLanguage) - Method in class com.optimaize.langdetect.DetectedLanguage
See class header.
CONV_THRESHOLD - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
TODO document what this is for, and why that value is chosen.
convert(LangProfile) - Static method in class com.optimaize.langdetect.profiles.OldLangProfileConverter
 
count() - Method in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
count_ - Variable in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
countByScript(CharSequence) - Method in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
 
create() - Method in class com.optimaize.langdetect.text.TextObjectFactory
 
create(NgramExtractor) - Static method in class com.optimaize.langdetect.LanguageDetectorBuilder
 
create(Collection<LanguageProfile>, Collection<Integer>) - Static method in class com.optimaize.langdetect.NgramFrequencyData
 

D

DEFAULT_ALPHA - Static variable in class com.optimaize.langdetect.cybozu.CommandLineInterface
smoothing default parameter (ELE)
DEFAULT_SEED - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
This is used when no custom seed was passed in.
detect(CharSequence) - Method in interface com.optimaize.langdetect.LanguageDetector
Returns the best detected language if the algorithm is very confident.
detect(CharSequence) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
 
detectBlock(CharSequence) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
 
detectBlockLongText(List<String>) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
This is the original algorithm used for all text length.
detectBlockShortText(Map<String, Integer>) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
 
DetectedLanguage - Class in com.optimaize.langdetect
Holds information about a detected language: the locale (language) and the probability.
DetectedLanguage(LdLocale, double) - Constructor for class com.optimaize.langdetect.DetectedLanguage
 
detectLang() - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
Language detection test for each file (--detectlang option)

E

equals(Object) - Method in class com.optimaize.langdetect.i18n.LdLocale
 
equals(Object) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
extractCountedGrams(CharSequence) - Method in class com.optimaize.langdetect.ngram.NgramExtractor
 
extractGrams(CharSequence) - Method in class com.optimaize.langdetect.ngram.NgramExtractor
Creates the n-grams for a given text in the order they occur.
extractNGrams(CharSequence, OldNgramExtractor.Filter) - Static method in class com.optimaize.langdetect.ngram.OldNgramExtractor
Deprecated.

F

filter - Variable in class com.optimaize.langdetect.ngram.NgramExtractor
 
filter(NgramFilter) - Method in class com.optimaize.langdetect.ngram.NgramExtractor
 
filter(CharSequence) - Method in class com.optimaize.langdetect.text.CharNormalizerTextFilterImpl
Deprecated.
 
filter(CharSequence) - Method in class com.optimaize.langdetect.text.MultiTextFilter
 
filter(CharSequence) - Method in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
 
filter(CharSequence) - Method in interface com.optimaize.langdetect.text.TextFilter
 
filter(CharSequence) - Method in class com.optimaize.langdetect.text.UrlTextFilter
 
filters - Variable in class com.optimaize.langdetect.text.MultiTextFilter
 
findMost(Map<Character.UnicodeScript, Long>) - Method in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
 
forDetectingOnLargeText() - Static method in class com.optimaize.langdetect.text.CommonTextObjectFactories
 
forDetectingShortCleanText() - Static method in class com.optimaize.langdetect.text.CommonTextObjectFactories
 
forIndexing() - Static method in class com.optimaize.langdetect.text.CommonTextObjectFactories
 
forIndexingCleanText() - Static method in class com.optimaize.langdetect.text.CommonTextObjectFactories
 
forText(CharSequence) - Method in class com.optimaize.langdetect.text.TextObjectFactory
 
forThreshold(double) - Static method in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
If a script has less than this fraction of content compared to the most used one, its text is removed.
freq - Variable in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
Key = ngram, value = count.
FREQ_PATTERN - Static variable in class com.optimaize.langdetect.frma.LangProfileReader
 
fromString(String) - Static method in class com.optimaize.langdetect.i18n.LdLocale
 

G

generate(String, File) - Static method in class com.optimaize.langdetect.frma.GenProfile
Loads a text file and generate a language profile from its content.
generateProfile() - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
Generate Language Profile from a text file.
GenProfile - Class in com.optimaize.langdetect.cybozu
Load Wikipedia's abstract XML as corpus and generate its language profile in JSON format.
GenProfile - Class in com.optimaize.langdetect.frma
Generate a language profile from any given text file.
GenProfile() - Constructor for class com.optimaize.langdetect.cybozu.GenProfile
 
GenProfile() - Constructor for class com.optimaize.langdetect.frma.GenProfile
 
get(int) - Method in class com.optimaize.langdetect.cybozu.util.NGram
TODO this method has some weird, undocumented behavior to ignore ngrams with upper case.
getFreq() - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
 
getFrequency(String) - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
 
getFrequency(String) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
getGramLengths() - Method in class com.optimaize.langdetect.ngram.NgramExtractor
 
getGramLengths() - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
Tells what the n in n-grams are used here.
getGramLengths() - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
getInstance() - Static method in class com.optimaize.langdetect.ngram.BackwardsCompatibleNgramFilter
 
getInstance() - Static method in class com.optimaize.langdetect.ngram.StandardNgramFilter
 
getInstance() - Static method in class com.optimaize.langdetect.text.UrlTextFilter
 
getLanguage() - Method in class com.optimaize.langdetect.i18n.LdLocale
 
getLanguage(int) - Method in class com.optimaize.langdetect.NgramFrequencyData
 
getLanguageList() - Method in class com.optimaize.langdetect.NgramFrequencyData
 
getLanguages() - Static method in class com.optimaize.langdetect.profiles.BuiltInLanguages
 
getLocale() - Method in class com.optimaize.langdetect.DetectedLanguage
 
getLocale() - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
 
getLocale() - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
getMaxGramCount(int) - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
Tells how often the n-gram with the highest amount of occurrences used in this profile occurred.
getMaxGramCount(int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
getMinGramCount(int) - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
Tells how often the n-gram with the lowest amount of occurrences used in this profile occurred.
getMinGramCount(int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
getName() - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
 
getNumGramOccurrences(int) - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
Tells how often all n-grams of a certain length occurred, combined.
getNumGramOccurrences(int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
getNumGrams() - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
Tells how many n-grams there are for all n-gram sizes combined.
getNumGrams() - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
getNumGrams(int) - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
Tells how many different n-grams there are for a certain n-gram size.
getNumGrams(int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
getNWords() - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
 
getParamDouble(String, double) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
Returns the double, or the default is absent.
getParamLongOrNull(String) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
 
getProbabilities(CharSequence) - Method in interface com.optimaize.langdetect.LanguageDetector
Returns all languages with at least some likeliness.
getProbabilities(CharSequence) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
 
getProbabilities(String) - Method in class com.optimaize.langdetect.NgramFrequencyData
Don't modify this data structure! (Can't make array immutable...)
getProbability() - Method in class com.optimaize.langdetect.DetectedLanguage
 
getRegion() - Method in class com.optimaize.langdetect.i18n.LdLocale
 
getScript() - Method in class com.optimaize.langdetect.i18n.LdLocale
 
getShortTextLanguages() - Static method in class com.optimaize.langdetect.profiles.BuiltInLanguages
 
getString(String) - Static method in class com.optimaize.langdetect.cybozu.util.Messages
 
gramLength(int) - Static method in class com.optimaize.langdetect.ngram.NgramExtractor
 
gramLengths - Variable in class com.optimaize.langdetect.ngram.NgramExtractor
 
gramLengths(Integer...) - Static method in class com.optimaize.langdetect.ngram.NgramExtractor
 
grams_ - Variable in class com.optimaize.langdetect.cybozu.util.NGram
 
guessNumDistinctiveGrams(int, int) - Static method in class com.optimaize.langdetect.ngram.NgramExtractor
This is trying to be smart.

H

hashCode() - Method in class com.optimaize.langdetect.i18n.LdLocale
 
hashCode() - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
hasParam(String) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
 

I

increment(Map<Character.UnicodeScript, Long>, Character.UnicodeScript) - Method in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
 
initProbability() - Method in class com.optimaize.langdetect.LanguageDetectorImpl
Initialize the map of language probabilities.
INSTANCE - Static variable in class com.optimaize.langdetect.ngram.BackwardsCompatibleNgramFilter
 
INSTANCE - Static variable in class com.optimaize.langdetect.ngram.StandardNgramFilter
 
INSTANCE - Static variable in class com.optimaize.langdetect.text.UrlTextFilter
 
internalReader - Static variable in class com.optimaize.langdetect.profiles.LanguageProfileReader
 
IOUtils - Class in com.optimaize.langdetect.frma
Deprecated.
IOUtils() - Constructor for class com.optimaize.langdetect.frma.IOUtils
Deprecated.
Private constructor to prevent instantiation.
isSpace() - Method in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
iterateGrams() - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
Iterates all ngram strings with frequency.
iterateGrams() - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
iterateGrams(int) - Method in interface com.optimaize.langdetect.profiles.LanguageProfile
Iterates all gramLength-gram strings with frequency.
iterateGrams(int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
ITERATION_LIMIT - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
TODO document what this is for, and why that value is chosen.

L

langlist - Variable in class com.optimaize.langdetect.NgramFrequencyData
All the loaded languages, in exactly the same order as the data is in the double[] in wordLangProbMap.
LangProfile - Class in com.optimaize.langdetect.cybozu.util
Deprecated.
replaced by LanguageProfile
LangProfile() - Constructor for class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
Constructor for JSONIC
LangProfile(String) - Constructor for class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
Normal Constructor
LangProfileReader - Class in com.optimaize.langdetect.frma
Reads LangProfiles.
LangProfileReader() - Constructor for class com.optimaize.langdetect.frma.LangProfileReader
 
LangProfileWriter - Class in com.optimaize.langdetect.frma
Writes a LangProfile to an output stream (file).
LangProfileWriter() - Constructor for class com.optimaize.langdetect.frma.LangProfileWriter
 
langsAdded - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
language - Variable in class com.optimaize.langdetect.i18n.LdLocale
 
LanguageDetector - Interface in com.optimaize.langdetect
Guesses the language of an input string or text.
LanguageDetectorBuilder - Class in com.optimaize.langdetect
Builder for LanguageDetector.
LanguageDetectorBuilder(NgramExtractor) - Constructor for class com.optimaize.langdetect.LanguageDetectorBuilder
 
LanguageDetectorImpl - Class in com.optimaize.langdetect
This class is immutable and thus thread-safe.
LanguageDetectorImpl(NgramFrequencyData, double, Optional<Long>, int, double, double, double, double, Map<LdLocale, Double>, NgramExtractor) - Constructor for class com.optimaize.langdetect.LanguageDetectorImpl
LanguageLister - Class in com.optimaize.langdetect.profiles.util
This is just a utility to update the code with the existing languages.
LanguageLister() - Constructor for class com.optimaize.langdetect.profiles.util.LanguageLister
 
languagePriorities(Map<LdLocale, Double>) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
TODO document exactly.
LanguageProfile - Interface in com.optimaize.langdetect.profiles
A language profile knows the locale (language), and contains the n-grams and some statistics.
LanguageProfileBuilder - Class in com.optimaize.langdetect.profiles
Builder for LanguageProfile.
LanguageProfileBuilder(LdLocale) - Constructor for class com.optimaize.langdetect.profiles.LanguageProfileBuilder
 
LanguageProfileBuilder(String) - Constructor for class com.optimaize.langdetect.profiles.LanguageProfileBuilder
Deprecated.
LanguageProfileImpl - Class in com.optimaize.langdetect.profiles
This class is immutable.
LanguageProfileImpl(LdLocale, Map<Integer, Map<String, Integer>>) - Constructor for class com.optimaize.langdetect.profiles.LanguageProfileImpl
Use the builder.
LanguageProfileImpl.Stats - Class in com.optimaize.langdetect.profiles
 
LanguageProfileReader - Class in com.optimaize.langdetect.profiles
LanguageProfileReader() - Constructor for class com.optimaize.langdetect.profiles.LanguageProfileReader
 
languageProfiles - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
LanguageProfileWriter - Class in com.optimaize.langdetect.profiles
Writes a LanguageProfile to an output stream or file.
LanguageProfileWriter() - Constructor for class com.optimaize.langdetect.profiles.LanguageProfileWriter
 
languages - Static variable in class com.optimaize.langdetect.profiles.BuiltInLanguages
 
langWeightingMap - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
LATIN1_EXCLUDED - Static variable in class com.optimaize.langdetect.cybozu.util.CharNormalizer
 
LdLocale - Class in com.optimaize.langdetect.i18n
A language-detector implementation of a Locale, similar to the java.util.Locale.
LdLocale(String, Optional<String>, Optional<String>) - Constructor for class com.optimaize.langdetect.i18n.LdLocale
 
length() - Method in class com.optimaize.langdetect.text.TextObject
 
LESS_FREQ_RATIO - Static variable in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
Explanation by example: If the most frequent n-gram occurs 1 mio times, then 1'000'000 / this (100'000) = 10.
load(String, File) - Static method in class com.optimaize.langdetect.cybozu.GenProfile
Load Wikipedia abstract database file and generate its language profile
locale - Variable in class com.optimaize.langdetect.DetectedLanguage
 
locale - Variable in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
 
locale - Variable in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
logger - Static variable in class com.optimaize.langdetect.cybozu.GenProfile
 
logger - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
looksLikeGeoCode3166_1(String) - Static method in class com.optimaize.langdetect.i18n.LdLocale
 
looksLikeGeoCodeNumeric(String) - Static method in class com.optimaize.langdetect.i18n.LdLocale
 
looksLikeLanguageProfileFile(File) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
 
looksLikeLanguageProfileName(String) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
 
looksLikeScriptCode(String) - Static method in class com.optimaize.langdetect.i18n.LdLocale
 

M

MAIL_REGEX - Static variable in class com.optimaize.langdetect.text.UrlTextFilter
 
main(String[]) - Static method in class com.optimaize.langdetect.cybozu.CommandLineInterface
Command Line Interface
main(String[]) - Static method in class com.optimaize.langdetect.profiles.util.LanguageLister
 
makeDetector() - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
Using all language profiles from the given directory.
makeInternalPrioMap(Map<LdLocale, Double>, List<LdLocale>) - Static method in class com.optimaize.langdetect.cybozu.util.Util
 
makePathForClassLoader(String, String) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
 
makeProfileFileName(LdLocale) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
 
makeStats(Map<Integer, Map<String, Integer>>) - Static method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
maxGramCounts - Variable in class com.optimaize.langdetect.profiles.LanguageProfileImpl.Stats
Key = gram length (1-3 or so).
maxTextLength - Variable in class com.optimaize.langdetect.text.TextObject
 
maxTextLength - Variable in class com.optimaize.langdetect.text.TextObjectFactory
 
maxTextLength - Variable in class com.optimaize.langdetect.text.TextObjectFactoryBuilder
 
maxTextLength(int) - Method in class com.optimaize.langdetect.text.TextObjectFactoryBuilder
 
Messages - Class in com.optimaize.langdetect.cybozu.util
This is Messages class generated by Eclipse automatically.
Messages() - Constructor for class com.optimaize.langdetect.cybozu.util.Messages
 
minGramCounts - Variable in class com.optimaize.langdetect.profiles.LanguageProfileImpl.Stats
Key = gram length (1-3 or so).
minimalConfidence - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
minimalConfidence - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
minimalConfidence(double) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
LanguageDetector.detect(java.lang.CharSequence) returns a language if the best detected language has at least this probability.
minimalFrequency - Variable in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
 
minimalFrequency(int) - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
 
MINIMUM_FREQ - Static variable in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
n-grams that occur less than this often can be removed using omitLessFreq().
MultiTextFilter - Class in com.optimaize.langdetect.text
Groups multiple TextFilters as one and runs them in the given order.
MultiTextFilter(List<TextFilter>) - Constructor for class com.optimaize.langdetect.text.MultiTextFilter
 

N

N_GRAM - Static variable in class com.optimaize.langdetect.cybozu.util.NGram
ngrams are created from 1gram to this amount, currently 2grams and 3grams.
N_TRIAL - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
TODO document what this is for, and why that value is chosen.
N_WORDS_PATTERN - Static variable in class com.optimaize.langdetect.frma.LangProfileReader
 
name - Variable in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
The language name (identifier).
NAME_PATTERN - Static variable in class com.optimaize.langdetect.frma.LangProfileReader
 
NGram - Class in com.optimaize.langdetect.cybozu.util
TODO document.
NGram() - Constructor for class com.optimaize.langdetect.cybozu.util.NGram
 
ngramExtractor - Static variable in class com.optimaize.langdetect.cybozu.util.Util
 
ngramExtractor - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
ngramExtractor - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
ngramExtractor - Variable in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
 
ngramExtractor(NgramExtractor) - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
 
NgramExtractor - Class in com.optimaize.langdetect.ngram
Class for extracting n-grams out of a text.
NgramExtractor(List<Integer>, NgramFilter, Character) - Constructor for class com.optimaize.langdetect.ngram.NgramExtractor
 
NgramExtractors - Class in com.optimaize.langdetect.ngram
Provides easy access to commonly used NgramExtractor configs.
NgramExtractors() - Constructor for class com.optimaize.langdetect.ngram.NgramExtractors
 
NgramFilter - Interface in com.optimaize.langdetect.ngram
Filters out some undesired n-grams.
ngramFrequencyData - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
NgramFrequencyData - Class in com.optimaize.langdetect
Contains frequency information for n-grams coming from multiple LanguageProfiles.
NgramFrequencyData(Map<String, double[]>, List<LdLocale>) - Constructor for class com.optimaize.langdetect.NgramFrequencyData
 
ngrams - Variable in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
 
ngrams - Variable in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
normalize(char) - Static method in class com.optimaize.langdetect.cybozu.util.CharNormalizer
Character Normalization (and exclusion).
normalizeProb(double[]) - Static method in class com.optimaize.langdetect.cybozu.util.Util
normalize probabilities and check convergence by the maximum probability
numOccurrences - Variable in class com.optimaize.langdetect.profiles.LanguageProfileImpl.Stats
Key = gram length (1-3 or so).
nWords - Variable in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
Tells how many occurrences of n-grams exist per gram length.

O

OldLangProfileConverter - Class in com.optimaize.langdetect.profiles
Converts an old LangProfile to a new LanguageProfile.
OldLangProfileConverter() - Constructor for class com.optimaize.langdetect.profiles.OldLangProfileConverter
 
OldNgramExtractor - Class in com.optimaize.langdetect.ngram
Deprecated.
OldNgramExtractor() - Constructor for class com.optimaize.langdetect.ngram.OldNgramExtractor
Deprecated.
 
OldNgramExtractor.Filter - Interface in com.optimaize.langdetect.ngram
Deprecated.
 
omitLessFreq() - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
Removes ngrams that occur fewer times than MINIMUM_FREQ to get rid of rare ngrams.
opt_with_value - Variable in class com.optimaize.langdetect.cybozu.CommandLineInterface
for Command line easy parser
opt_without_value - Variable in class com.optimaize.langdetect.cybozu.CommandLineInterface
 

P

parse(String[]) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
Command line easy parser
prefixFactor - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
prefixFactor - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
prefixFactor(double) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
To weight n-grams that are on the left border of a word differently from n-grams in the middle of words, assign a value here.
priorMap - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
User-defined language priorities, in the same order as langlist.
probability - Variable in class com.optimaize.langdetect.DetectedLanguage
 
PROBABILITY_SORTING_COMPARATOR - Static variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
probabilityThreshold - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
probabilityThreshold - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
probabilityThreshold(double) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
LanguageDetector.getProbabilities(java.lang.CharSequence) does not return languages with less probability than this.
PROFILES_DIR - Static variable in class com.optimaize.langdetect.profiles.LanguageProfileReader
 

R

read(File) - Method in class com.optimaize.langdetect.frma.LangProfileReader
Reads a LangProfile from a File in UTF-8.
read(File) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
Reads a LanguageProfile from a File in UTF-8.
read(InputStream) - Method in class com.optimaize.langdetect.frma.LangProfileReader
Reads a LangProfile from an InputStream in UTF-8.
read(InputStream) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
Reads a LanguageProfile from an InputStream in UTF-8.
read(ClassLoader, String, Collection<String>) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
Load profiles from the classpath in a specific directory.
read(String, Collection<String>) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
Same as LanguageProfileReader.read(ClassLoader, String, java.util.Collection) using the class loader of this class.
read(Collection<String>) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
Same as LanguageProfileReader.read(ClassLoader, String, java.util.Collection) using the class loader of this class, and the default profiles directory of this library.
readAll() - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
Deprecated.
renamed to readAllBuiltIn()
readAll(File) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
Loads all profiles from the specified directory.
readAllBuiltIn() - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
Reads all built-in language profiles from the "languages" folder (shipped with the jar).
readBuiltIn(LdLocale) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
 
readBuiltIn(Collection<LdLocale>) - Method in class com.optimaize.langdetect.profiles.LanguageProfileReader
 
readFilesFromClassPathFolder(String) - Static method in class com.optimaize.langdetect.profiles.util.LanguageLister
 
region - Variable in class com.optimaize.langdetect.i18n.LdLocale
 
remove(CharSequence, Set<Character.UnicodeScript>) - Method in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
 
RemoveMinorityScriptsTextFilter - Class in com.optimaize.langdetect.text
Removes text written in scripts that are not the dominant script of the text.
RemoveMinorityScriptsTextFilter(double) - Constructor for class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
 
removeNgramsWithLessFrequency() - Method in class com.optimaize.langdetect.profiles.LanguageProfileBuilder
 
requireParamString(String) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
 
RESOURCE_BUNDLE - Static variable in class com.optimaize.langdetect.cybozu.util.Messages
 

S

script - Variable in class com.optimaize.langdetect.i18n.LdLocale
 
searchFile(File, String) - Method in class com.optimaize.langdetect.cybozu.CommandLineInterface
File search (easy glob)
seed - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
seed - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
seed(long) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
 
seed(Optional<Long>) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
 
serialVersionUID - Static variable in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
 
setFreq(Map<String, Integer>) - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
 
setName(String) - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
 
setNWords(int[]) - Method in class com.optimaize.langdetect.cybozu.util.LangProfile
Deprecated.
 
setTag(String) - Method in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
shortTextAlgorithm - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
shortTextAlgorithm - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
shortTextAlgorithm(int) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
Defaults to 0, which means don't use this feature.
shortTextLanguages - Static variable in class com.optimaize.langdetect.profiles.BuiltInLanguages
 
sortProbability(double[]) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
Returns the detected languages sorted by probabilities descending.
standard() - Static method in class com.optimaize.langdetect.ngram.NgramExtractors
The new standard n-gram algorithm.
STANDARD - Static variable in class com.optimaize.langdetect.ngram.NgramExtractors
 
StandardNgramFilter - Class in com.optimaize.langdetect.ngram
Filters what is generally not desired.
StandardNgramFilter() - Constructor for class com.optimaize.langdetect.ngram.StandardNgramFilter
 
stats - Variable in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
Stats(Map<Integer, Long>, Map<Integer, Long>, Map<Integer, Long>) - Constructor for class com.optimaize.langdetect.profiles.LanguageProfileImpl.Stats
 
stringBuilder - Variable in class com.optimaize.langdetect.text.TextObject
 
subSequence(int, int) - Method in class com.optimaize.langdetect.text.TextObject
 
suffixFactor - Variable in class com.optimaize.langdetect.LanguageDetectorBuilder
 
suffixFactor - Variable in class com.optimaize.langdetect.LanguageDetectorImpl
 
suffixFactor(double) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
Defaults to 1.0, which means don't use this feature.

T

tag_ - Variable in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
TagExtractor - Class in com.optimaize.langdetect.cybozu.util
TagExtractor is a class which extracts inner texts of specified tag.
TagExtractor(String, int) - Constructor for class com.optimaize.langdetect.cybozu.util.TagExtractor
 
target_ - Variable in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
textFilter - Variable in class com.optimaize.langdetect.text.TextObject
 
textFilter - Variable in class com.optimaize.langdetect.text.TextObjectFactory
 
TextFilter - Interface in com.optimaize.langdetect.text
Allows to filter content from a text to be ignored for the n-gram analysis.
textFilters - Variable in class com.optimaize.langdetect.text.TextObjectFactoryBuilder
 
TextObject - Class in com.optimaize.langdetect.text
A convenient text object implementing CharSequence and Appendable.
TextObject(TextFilter, int) - Constructor for class com.optimaize.langdetect.text.TextObject
 
textObjectFactory - Static variable in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
textObjectFactory - Static variable in class com.optimaize.langdetect.frma.GenProfile
 
TextObjectFactory - Class in com.optimaize.langdetect.text
Factory for TextObjects.
TextObjectFactory(TextFilter, int) - Constructor for class com.optimaize.langdetect.text.TextObjectFactory
 
TextObjectFactoryBuilder - Class in com.optimaize.langdetect.text
Builder for TextObjectFactory.
TextObjectFactoryBuilder() - Constructor for class com.optimaize.langdetect.text.TextObjectFactoryBuilder
 
textPadding - Variable in class com.optimaize.langdetect.ngram.NgramExtractor
 
textPadding(char) - Method in class com.optimaize.langdetect.ngram.NgramExtractor
To ensure having border grams, this character is added to the left and right of the text.
threshold - Variable in class com.optimaize.langdetect.text.RemoveMinorityScriptsTextFilter
 
threshold_ - Variable in class com.optimaize.langdetect.cybozu.util.TagExtractor
 
toString() - Method in class com.optimaize.langdetect.DetectedLanguage
 
toString() - Method in class com.optimaize.langdetect.i18n.LdLocale
The output of this can be fed to the fromString() method.
toString() - Method in class com.optimaize.langdetect.profiles.LanguageProfileImpl
 
toString() - Method in class com.optimaize.langdetect.text.TextObject
 

U

unicodeEncode(String) - Static method in class com.optimaize.langdetect.cybozu.util.Util
unicode encoding (for verbose mode)
updateLangProb(double[], String, int, double) - Method in class com.optimaize.langdetect.LanguageDetectorImpl
update language probabilities with N-gram string(N=1,2,3)
URL_REGEX - Static variable in class com.optimaize.langdetect.text.UrlTextFilter
 
UrlTextFilter - Class in com.optimaize.langdetect.text
Removes URLs and email addresses from the text.
UrlTextFilter() - Constructor for class com.optimaize.langdetect.text.UrlTextFilter
 
use(String) - Method in class com.optimaize.langdetect.ngram.BackwardsCompatibleNgramFilter
 
use(String) - Method in interface com.optimaize.langdetect.ngram.NgramFilter
 
use(String) - Method in interface com.optimaize.langdetect.ngram.OldNgramExtractor.Filter
Deprecated.
Allows to skip some n-grams.
use(String) - Method in class com.optimaize.langdetect.ngram.StandardNgramFilter
 
Util - Class in com.optimaize.langdetect.cybozu.util
A place for sharing code.
Util() - Constructor for class com.optimaize.langdetect.cybozu.util.Util
 

V

values - Variable in class com.optimaize.langdetect.cybozu.CommandLineInterface
 

W

withProfile(LanguageProfile) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
 
withProfiles(Iterable<LanguageProfile>) - Method in class com.optimaize.langdetect.LanguageDetectorBuilder
 
withTextFilter(TextFilter) - Method in class com.optimaize.langdetect.text.TextObjectFactoryBuilder
Adds the given TextFilter to be run on TextObject.append(java.io.Reader) methods.
wordLangProbMap - Variable in class com.optimaize.langdetect.NgramFrequencyData
Key = ngram Value = array with probabilities per loaded language, in the same order as langlist.
wordProbToString(double[], List<LdLocale>) - Static method in class com.optimaize.langdetect.cybozu.util.Util
 
write(LangProfile, OutputStream) - Method in class com.optimaize.langdetect.frma.LangProfileWriter
Writes a LangProfile to an OutputStream in UTF-8.
write(LanguageProfile, OutputStream) - Method in class com.optimaize.langdetect.profiles.LanguageProfileWriter
Writes a LanguageProfile to an OutputStream in UTF-8.
writeToDirectory(LanguageProfile, File) - Method in class com.optimaize.langdetect.profiles.LanguageProfileWriter
Writes a LanguageProfile to a folder using the language name as the file name.

_

_extractCounted(CharSequence, int, int, Map<String, Integer>) - Method in class com.optimaize.langdetect.ngram.NgramExtractor
 
A B C D E F G H I L M N O P R S T U V W _ 
All Classes and Interfaces|All Packages|Constant Field Values|Serialized Form