This article discusses the role of government regulation in AI ethics, emphasizing the need for a combined community and top-down approach for the development of AI systems.
This GitHub repository, titled 'GPT-Researcher' by Assafelovic, contains resources and information related to AI and machine learning, focusing on GPT models.
A captivating visualization by Financial Times that provides an in-depth understanding of how transformers work in the realm of Generative AI. It offers insights into the mechanics and intricacies of transformer architectures, showcasing the beauty of today's Large Language Models (LLMs).
An article discussing the importance of ranking models in search engines and how Weaviate, an open-source knowledge graph, has introduced a new feature allowing users to define their ranking models.
The documentation details the SubQuestionQueryEngine in the LlamaIndex library. This query engine breaks down complex queries into multiple sub-questions, which are then directed to their target query engine for execution. The responses from the sub-questions are synthesized to produce the final response.
The paper discusses the capabilities of large pre-trained language models and their limitations in accessing and manipulating knowledge. The authors introduce retrieval-augmented generation (RAG) models that combine pre-trained parametric and non-parametric memory for language generation. The study explores the effectiveness of RAG models in various NLP tasks and compares them with other architectures.
Without RAG, an LLM is only as smart as the data it was trained on. Meaning, LLMs can only generate text based purely on what its “seen”, rather than pull in new information after the training cut-off. Sam Altman stated “the right way to think of the models that we create is a reasoning engine, not a fact database.” Essentially, we should only use the language model for its reasoning ability, not for the knowledge it has.
Multi-Column Markdown is a document formatting plugin for the ObsidianMD note taking application. It was created by Cameron Robinson to fill a gap in Obsidian's functionality, and it has been released as an official plugin for Obsidian.
Miro is a collaborative online whiteboard platform designed for remote and distributed teams. It is also used in the Klima-Schülerlabor for digital worksheets.
J. Pfister, T. Völker, A. Vlasjuk, und A. Hotho. Proceedings of the 1st Joint Workshop on Large Language Models and Structure Modeling (XLLM 2025), Seite 115--128. Vienna, Austria, Association for Computational Linguistics, (August 2025)
J. Pfister, J. Wunderle, und A. Hotho. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Seite 2227--2246. Vienna, Austria, Association for Computational Linguistics, (Juli 2025)
K. Kobs, T. Koopmann, A. Zehe, D. Fernes, P. Krop, und A. Hotho. Findings of the Association for Computational Linguistics: EMNLP 2020, Seite 878--883. Online, Association for Computational Linguistics, (November 2020)
Y. Yang, C. Huang, L. Xia, und C. Li. Proceedings of the 45th international ACM SIGIR conference on research and development in information retrieval, Seite 1434--1443. (2022)
K. Kobs, J. Pfister, und A. Hotho. Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), Seite 1529--1536. Mexico City, Mexico, Association for Computational Linguistics, (Juni 2024)
J. Wunderle, J. Schubert, A. Cacciatore, A. Zehe, J. Pfister, und A. Hotho. Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), Seite 602--612. Mexico City, Mexico, Association for Computational Linguistics, (Juni 2024)