gresch > hadoop software java

bookmarks (hide)2
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1Sqoop « Cloudera » Apache Hadoop for the Enterprise
Sqoop is a tool designed to import data from relational databases into Hadoop. Sqoop uses JDBC to connect to a database. It examines each table’s schema and automatically generates the necessary classes to import data into the Hadoop Distributed File System (HDFS). Sqoop then creates and launches a MapReduce job to read tables from the database via DBInputFormat, the JDBC-based InputFormat. Tables are read into a set of files in HDFS. Sqoop supports both SequenceFile and text-based target and includes performance enhancements for loading data from MySQL.
15 years ago by @gresch
show all tags
apache
db
dbms
hadoop
hdfs
java
mapreduce
software
sql
apachedbdbmshadoophdfsjavamapreducesoftwaresql
(0)
copydelete
- community post
- history of this post
2katta - distributed lucene
Katta is a scalable, failure tolerant, distributed, data storage for real time access. Katta serves large, replicated, indices as shards to serve high loads and very large data sets. These indices can be of different type. Currently implementations are available for Lucene and Hadoop mapfiles. * Makes serving large or high load indices easy * Serves very large Lucene or Hadoop Mapfile indices as index shards on many servers * Replicate shards on different servers for performance and fault-tolerance * Supports pluggable network topologies * Master fail-over * Fast, lightweight, easy to integrate * Plays well with Hadoop clusters * Apache Version 2 License
15 years ago by @gresch
show all tags
cloud
data
framework
hadoop
indices
java
lucene
mapreduce
search
searchengine
searching
shards
software
tools
clouddataframeworkhadoopindicesjavalucenemapreducesearchsearchenginesearchingshardssoftwaretools
(0)
copydelete
- community post
- history of this post

⟨⟨
⟨
1
⟩
⟩⟩

publications (hide)
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

No matching posts.

⟨⟨
⟨
⟩
⟩⟩

BibSonomy

bookmarks (hide)2
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1Sqoop « Cloudera » Apache Hadoop for the Enterprise

2katta - distributed lucene

publications (hide)
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

browse

related tags

concepts

tags

BibSonomy

bookmarks (hide)2 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

1Sqoop « Cloudera » Apache Hadoop for the Enterprise

2katta - distributed lucene

publications (hide) displayallpublications onlypublications per page5102050100 sort byadded attitleauthorpublication dateentry typehelp for advanced sorting... RSSBibTeXRDFmore...

browse

related tags

concepts

tags

bookmarks (hide)2
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

publications (hide)
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...