LOD-a-lot democratizes access to the Linked Open Data (LOD) Cloud by serving more than 28 billion unique triples from 650K datasets from a single self-indexed file. This corpus can be queried online with a sustainable Linked Data Fragments interface, or it can be downloaded and consumed locally: LOD-a-lot is easy to deploy and only requires limited resources (524 GB of disk space and 15.7 GB of RAM), enabling web-scale repeatable experimentation and research from a high-end laptop.
Wappalyzer is a cross-platform utility that uncovers the technologies used on websites. It detects content management systems, ecommerce platforms, web frameworks, server software, analytics tools and many more.
A. Spitz, J. Strötgen, and M. Gertz. Companion Proceedings of the The Web Conference 2018, page 1731--1736. Republic and Canton of Geneva, Switzerland, International World Wide Web Conferences Steering Committee, (2018)
J. Rennie, and A. McCallum. Proceedings of the Sixteenth International Conference on Machine Learning, page 335--343. San Francisco, CA, USA, Morgan Kaufmann Publishers Inc., (1999)
P. Singer, D. Helic, A. Hotho, and M. Strohmaier. Proceedings of the 24th International Conference on World Wide Web, page 1003--1013. Republic and Canton of Geneva, Switzerland, International World Wide Web Conferences Steering Committee, (2015)