tag :: lucene tools

Lesezeichen (verstecken)7
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML

2katta - distributed lucene
Katta is a scalable, failure tolerant, distributed, data storage for real time access. Katta serves large, replicated, indices as shards to serve high loads and very large data sets. These indices can be of different type. Currently implementations are available for Lucene and Hadoop mapfiles. * Makes serving large or high load indices easy * Serves very large Lucene or Hadoop Mapfile indices as index shards on many servers * Replicate shards on different servers for performance and fault-tolerance * Supports pluggable network topologies * Master fail-over * Fast, lightweight, easy to integrate * Plays well with Hadoop clusters * Apache Version 2 License
vor 15 Jahren von @gresch
alle anzeigen
cloud
data
framework
hadoop
indices
java
lucene
mapreduce
search
searchengine
searching
shards
software
tools
clouddataframeworkhadoopindicesjavalucenemapreducesearchsearchenginesearchingshardssoftwaretools
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Hadoop Cluster Setup
http://hadoop.apache.org/common/docs/r0.17.1/cluster_setup.html
vor 15 Jahren von @beate
alle anzeigen
crawl
hadoop
lucene
search
tools
tutorial
crawlhadooplucenesearchtoolstutorial
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
3Introduction to Nutch, Part 1: Crawling | Java.net
-dumppageurl
vor 15 Jahren von @beate
alle anzeigen
lucene
nutch
search
tools
tutorial
lucenenutchsearchtoolstutorial
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
1Nutch version 0.8.x tutorial
<property> <name>http.agent.name</name> <value></value> <description>HTTP 'User-Agent' request header. MUST NOT be empty - please set this to a single word uniquely related to your organization. NOTE: You should also check other related properties: http.robots.agents http.agent.description http.agent.url http.agent.email http.agent.version and set their values appropriately. </description> </property> <property> <name>http.agent.description</name> <value></value> <description>Further description of our bot- this text is used in the User-Agent header. It appears in parenthesis after the agent name. </description> </property> <property> <name>http.agent.url</name> <value></value> <description>A URL to advertise in the User-Agent header. This will appear in parenthesis after the agent name. Custom dictates that this should be a URL of a page explaining the purpose and behavior of this crawler. </description> </property> <property> <name>http.agent.email</name> <value></value> <description>An email address to advertise in the HTTP 'From' request header and User-Agent header. A good practice is to mangle this address (e.g. 'info at example dot com') to avoid spamming. </description> </property>
vor 15 Jahren von @beate
alle anzeigen
crawl
lucene
nutch
search
tools
tutorial
crawllucenenutchsearchtoolstutorial
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags
2Salmon Run: Nutch: Getting my Feet Wet
http://sujitpal.blogspot.com/2009/07/nutch-getting-my-feet-wet.html
vor 15 Jahren von @beate
alle anzeigen
crawl
lucene
nutch
search
tools
crawllucenenutchsearchtools
(0)
KopierenLöschen
- Community-Eintrag
- Versionsverlauf dieses Eintrags

⟨⟨
⟨
1
2
⟩
⟩⟩

Publikationen (verstecken)
Anzeige
alles
nur Publikationen
Publikationen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
Autor
Erscheinungsdatum
Eintragstyp
Hilfe für erweiterte Sortierung...
RSS
BibTeX
RDF
mehr...

Keine Treffer.

⟨⟨
⟨
⟩
⟩⟩

BibSonomy

Lesezeichen (verstecken)7
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML

2katta - distributed lucene

1Hadoop Cluster Setup

3Introduction to Nutch, Part 1: Crawling | Java.net

1Nutch version 0.8.x tutorial

2Salmon Run: Nutch: Getting my Feet Wet

Publikationen (verstecken)
Anzeige
alles
nur Publikationen
Publikationen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
Autor
Erscheinungsdatum
Eintragstyp
Hilfe für erweiterte Sortierung...
RSS
BibTeX
RDF
mehr...

Stöbern

Verwandte Tags

BibSonomy

Lesezeichen (verstecken)7 Anzeigeallesnur LesezeichenLesezeichen pro Seite5102050100 sortieren nachhinzugefügt amTitel RSSBibTeXXML

2katta - distributed lucene

1Hadoop Cluster Setup

3Introduction to Nutch, Part 1: Crawling | Java.net

1Nutch version 0.8.x tutorial

2Salmon Run: Nutch: Getting my Feet Wet

Publikationen (verstecken) Anzeigeallesnur PublikationenPublikationen pro Seite5102050100 sortieren nachhinzugefügt amTitelAutorErscheinungsdatumEintragstypHilfe für erweiterte Sortierung... RSSBibTeXRDFmehr...

Stöbern

Verwandte Tags

Lesezeichen (verstecken)7
Anzeige
alles
nur Lesezeichen
Lesezeichen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
RSS
BibTeX
XML

Publikationen (verstecken)
Anzeige
alles
nur Publikationen
Publikationen pro Seite
5
10
20
50
100
sortieren nach
hinzugefügt am
Titel
Autor
Erscheinungsdatum
Eintragstyp
Hilfe für erweiterte Sortierung...
RSS
BibTeX
RDF
mehr...