Apache Solr Language Identifier


Apache Solr Language Identifier

This module is intended to be used while indexing documents. It is implemented as an UpdateProcessor to be placed in an UpdateChain. Its purpose is to identify language from documents and tag the document with language code.

Compile priklausomybės (94)

Grupė / Artefaktas Versija Naujesnė Versija
dom4j » dom4j 1.6.1 1.4-dev-8
org.apache.xmlbeans » xmlbeans 2.6.0 5.0.1
org.apache.lucene » lucene-suggest 4.9.1 9.9.1
org.apache.lucene » lucene-spatial 4.9.1 7.7.3
net.arnx » jsonic 1.2.7 1.3.10
org.apache.solr » solr-core 4.9.1 9.6.0
org.apache.hadoop » hadoop-annotations 2.2.0 3.3.1
org.apache.solr » solr-solrj 4.9.1 9.6.0
org.apache.httpcomponents » httpclient 4.3.1 4.5.11
org.apache.hadoop » hadoop-hdfs 2.2.0 3.3.1
org.apache.httpcomponents » httpmime 4.3.1 4.5.12
org.apache.hadoop » hadoop-common 2.2.0 3.3.1
org.apache.hadoop » hadoop-auth 2.2.0 3.3.1
commons-io » commons-io 2.3 2.11.0
commons-lang » commons-lang 2.6 NA
org.eclipse.jetty » jetty-deploy 8.1.10.v20130312 10.0.6
xerces » xercesImpl 2.9.1 RELEASE
com.googlecode.juniversalchardet » juniversalchardet 1.0.3 NA
org.eclipse.jetty.orbit » javax.servlet 3.0.0.v201112011016 NA
org.eclipse.jetty » jetty-xml 8.1.10.v20130312 9.4.44.v20210927
org.eclipse.jetty » jetty-webapp 8.1.10.v20130312 9.4.44.v20210927
rome » rome 0.9 1.0
org.noggit » noggit 0.5 0.8
org.apache.httpcomponents » httpcore 4.3 4.4.15
org.slf4j » slf4j-api 1.7.6 2.0.12
org.antlr » antlr-runtime 3.5 3.5.2
com.uwyn » jhighlight 1.0 NA
com.googlecode.mp4parser » isoparser 1.0-RC-1 1.1.22
org.ow2.asm » asm-commons 4.1 9.2
jdom » jdom 1.0 1.1
com.google.guava » guava 14.0.1 33.0.0-jre
org.slf4j » slf4j-log4j12 1.7.6 2.0.12
org.ow2.asm » asm 4.1 9.2
org.apache.tika » tika-xmp 1.5 1.27
org.apache.tika » tika-parsers 1.5 1.27
org.apache.tika » tika-core 1.5 1.27
com.cybozu.labs » langdetect 1.1-20120112 NA
joda-time » joda-time 2.2 2.12.7
org.apache.lucene » lucene-analyzers-common 4.9.1 8.10.1
com.drewnoakes » metadata-extractor 2.6.2 2.16.0
commons-cli » commons-cli 1.2 1.4
org.aspectj » aspectjrt 1.6.11 1.9.21.2
org.apache.lucene » lucene-analyzers-phonetic 4.9.1 8.10.1
org.apache.lucene » lucene-analyzers-kuromoji 4.9.1 8.10.1
org.apache.lucene » lucene-grouping 4.9.1 9.9.1
log4j » log4j 1.2.17 NA
org.apache.lucene » lucene-highlighter 4.9.1 9.9.1
org.apache.lucene » lucene-expressions 4.9.1 9.9.1
org.apache.lucene » lucene-core 4.9.1 9.9.1
com.carrotsearch » hppc 0.5.2 0.9.0
org.apache.lucene » lucene-codecs 4.9.1 9.9.1
org.apache.lucene » lucene-queries 4.9.1 9.9.1
org.apache.lucene » lucene-queryparser 4.9.1 9.9.1
org.apache.lucene » lucene-misc 4.9.1 9.9.1
org.apache.lucene » lucene-join 4.9.1 9.9.1
org.apache.lucene » lucene-memory 4.9.1 9.9.1
org.gagravarr » vorbis-java-core 0.1 0.8
org.ccil.cowan.tagsoup » tagsoup 1.2.1 NA
org.gagravarr » vorbis-java-tika 0.1 0.8
com.google.protobuf » protobuf-java 2.5.0 3.25.3
commons-configuration » commons-configuration 1.6 1.10
org.apache.commons » commons-compress 1.7 1.21
org.tukaani » xz 1.4 1.9
org.apache.zookeeper » zookeeper 3.4.6 3.6.3
org.apache.pdfbox » pdfbox 1.8.4 3.0.0-alpha2
com.spatial4j » spatial4j 0.4.1 0.5
org.apache.james » apache-mime4j-core 0.7.2 0.8.4
org.apache.james » apache-mime4j-dom 0.7.2 0.8.4
com.adobe.xmp » xmpcore 5.1.2 6.1.11
commons-fileupload » commons-fileupload 1.2.1 1.4
org.apache.pdfbox » jempbox 1.8.4 1.8.16
org.apache.pdfbox » fontbox 1.8.4 3.0.0-alpha2
com.ibm.icu » icu4j 53.1 73.1
com.googlecode.concurrentlinkedhashmap » concurrentlinkedhashmap-lru 1.2 1.4.2
org.slf4j » jul-to-slf4j 1.7.6 2.0.12
org.apache.poi » poi 3.10.1 5.0.0
org.apache.poi » poi-ooxml 3.10.1 5.0.0
org.apache.poi » poi-scratchpad 3.10.1 5.0.0
org.apache.poi » poi-ooxml-schemas 3.10.1 4.1.2
org.eclipse.jetty » jetty-jmx 8.1.10.v20130312 9.4.44.v20210927
org.eclipse.jetty » jetty-util 8.1.10.v20130312 9.4.44.v20210927
org.eclipse.jetty » jetty-http 8.1.10.v20130312 10.0.6
org.eclipse.jetty » jetty-io 8.1.10.v20130312 9.4.44.v20210927
org.eclipse.jetty » jetty-servlet 8.1.10.v20130312 10.0.12
org.eclipse.jetty » jetty-security 8.1.10.v20130312 9.4.44.v20210927
org.eclipse.jetty » jetty-continuation 8.1.10.v20130312 9.4.44.v20210927
org.eclipse.jetty » jetty-server 8.1.10.v20130312 9.4.44.v20210927
org.codehaus.woodstox » wstx-asl 3.2.7 4.0.6
commons-codec » commons-codec 1.9 1.15
de.l3s.boilerpipe » boilerpipe 1.1.0 NA
org.bouncycastle » bcmail-jdk15 1.45 1.46
org.bouncycastle » bcprov-jdk15 1.45 1.46
org.restlet.jee » org.restlet 2.1.1 NA
org.restlet.jee » org.restlet.ext.servlet 2.1.1 NA