BibSonomy

The blue social bookmark and publication sharing system.

( en | de | ru )

 

group
  • tag
  • user
  • group
  • author
  • concept
  • BibTeX key
  • search
unknowndata
  • sign in
  • register
  • groups
  • genealogy
  • popular 
    • posts
    • tags
    • authors
    • concepts
    • discussions
  • sign in
  • register

Login

Log in with your username.

@

I've lost my password.


Log in with your OpenID-Provider.

  • Other OpenID-Provider
  1. group
  2. unknowndata
  3. bigdata web archive

Publication title

bookmarks  (hide)2
  • display
  • all
  • bookmarks only
  • bookmarks per page
  • 5
  • 10
  • 20
  • 50
  • 100
  • sort by
  • added at
  • title
  • RSS
  • BibTeX
  • XML

  •  

     
    1ia-web-commons/src/main/java/org/archive/hadoop/ResourceRecordReader.java at master · internetarchive/ia-web-commons
     

    https://github.com/internetarchive/ia-web-commons/blob/master/src/main/java/org/archive/hadoop/ResourceRecordReader.java
    12 years ago by @jaeschke
    show all tags
    • bigdata
    • web
    • archive
    • crawling
    • hadoop
    • analysis
    • warc
    • programming
     
      bigdatawebarchivecrawlinghadoopanalysiswarcprogramming
      copydelete
      • community post
      • history of this post
       
       
    •  

       
      2Web Archive Transformation (WAT) Specification, Utilities, and Usage Overview - Internet Research - IA Webteam Confluence
       

      https://webarchive.jira.com/wiki/display/Iresearch/Web+Archive+Transformation+(WAT)+Specification,+Utilities,+and+Usage+Overview
      12 years ago by @jaeschke
      show all tags
      • bigdata
      • web
      • wat
      • archive
      • crawling
      • hadoop
      • analysis
      • warc
       
        bigdatawebwatarchivecrawlinghadoopanalysiswarc
        copydelete
        • community post
        • history of this post
         
         
      • ⟨⟨
      • ⟨
      • 1
      • ⟩
      • ⟩⟩

      publications  (hide)
      • display
      • all
      • publications only
      • publications per page
      • 5
      • 10
      • 20
      • 50
      • 100
      • sort by
      • added at
      • title
      • author
      • publication date
      • entry type
      • help for advanced sorting...
      • RSS
      • BibTeX
      • RDF
      • more...

        No matching posts.
      • ⟨⟨
      • ⟨
      • ⟩
      • ⟩⟩

      unknowndata

      @unknowndata

      Unknown Data

      CVexplore
      join

      browse

      • bigdata web archive as tag from all users
      • web as concept from all users
      • bigdata web archive as concept from all users

      related tags

      • + | crawling
      • + | hadoop
      • + | analysis
      • + | warc
      • + | wat
      • + | programming

      tags

      • programming
      • social
      • web
      • myown
      • deeplearning
      • visualization
      • learning
      • recommender
      • survey
      • network
      • reference
      • analysis
      • ddm
      • search
      • folksonomy
      • deep
      • tagging
      • map
      • text
      • mining
      • bibsonomy
      • graph
      • dh
      • science
      • citation
      • collaborative
      • library
      • daa_botw
      • java
      • howto
      • tutorial
      • ranking
      • ai
      • google
      • mk5.4
      • bibtex
      • conference
      • digital
      • statistics
      • book
      • machine
      • semantic
      • archive
      • data
      • twitter
      • latex
      • linux
      • pagerank
      What is BibSonomy?
      Getting Started
      Browser Buttons
      Help
      Developer
      Overview
      API Documentation
      Contact & Privacy
      Contact
      Privacy & Terms of Use
      Cookies
      Report Issues
      BibSonomy Wiki
      Integration
      PUMA
      TYPO3 Extension
      WordPress Plugin
      Java REST Client
      Supported Sites
      more
      About BibSonomy
      Team
      Blog
      Mailing List
      Social Media
       Follow us on Twitter

      BibSonomy is offered by the Data Science Chair of the University of Würzburg, the Information Processing and Analytics Group of the Humboldt-Unversität zu Berlin, the KDE Group of the University of Kassel, and the L3S Research Center.