ghagerer | BibSonomy

bookmarks (hide)66
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1Wasserstein metric - Wikipedia
In mathematics, the Wasserstein or Kantorovich–Rubinstein metric or distance is a distance function defined between probability distributions on a given metric space M {\displaystyle M} M. Intuitively, if each distribution is viewed as a unit amount of "dirt" piled on M {\displaystyle M} M, the metric is the minimum "cost" of turning one pile into the other, which is assumed to be the amount of dirt that needs to be moved times the mean distance it has to be moved. Because of this analogy, the metric is known in computer science as the earth mover's distance.
5 years ago by @ghagerer
show all tags
probability-distribution-similarity
probability-distribution-similarity
copydelete
- community post
- history of this post
1GitHub - huggingface/pytorch-transformers: A library of state-of-the-art pretrained models for Natural Language Processing (NLP)
https://github.com/huggingface/pytorch-transformers
5 years ago by @ghagerer
show all tags
library
pre-trained
librarypre-trained
copydelete
- community post
- history of this post
1AI Trained on Old Scientific Papers Makes Discoveries Humans Missed - VICE
Scientists used machine learning to reveal new scientific knowledge hidden in old research papers.
5 years ago by @ghagerer
show all tags
scientific-papers
vice
word-vectors
word2vec
scientific-papersviceword-vectorsword2vec
copydelete
- community post
- history of this post
1Finally Remember Precision and Recall | by Jakub Rysavy | Towards Data Science
Can't remember what is precision and recall (sensitivity)? Why accuracy is not enough? Read the explanation with an example with a confusion matrix.
a year ago by @ghagerer
show all tags
precision
recall
metrics
precisionrecallmetrics
copydelete
- community post
- history of this post
1Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Quality | LMSYS Org
We introduce Vicuna-13B, an open-source chatbot trained by fine-tuning LLaMA on user-shared conversations collected from ShareGPT. Preliminary evaluation using GPT-4 as a judge shows Vicuna-13B achieves more than 90%* quality of OpenAI ChatGPT and Google Bard while outperforming other models like LLaMA and Stanford Alpaca in more than 90%* of cases. The cost of training Vicuna-13B is around $300. The code and weights, along with an online demo, are publicly available for non-commercial use.
a year ago by @ghagerer
show all tags
llama
evaluation
llms
open-source
architecture
deployment
llamaevaluationllmsopen-sourcearchitecturedeployment
copydelete
- community post
- history of this post
1All the Hard Stuff Nobody Talks About when Building Products with LLMs | Honeycomb
In this post, Phillip talks through the challenges & pitfalls of LLMs we faced when building our Query Assistant - and that you too may face.
a year ago by @ghagerer
show all tags
prompt-engineering
llms
deployment
prompt-engineeringllmsdeployment
copydelete
- community post
- history of this post
1Incremental Versioned Datasets In Kedro
Kedro versioned datasets can be mixed with incremental and partitioned datasets [ �� Unsure what kedro is? Check out this post. This was a question presented to
29 days ago by @ghagerer
show all tags
kedro
kedro
copydelete
- community post
- history of this post
1Efficient Extractive Question Answering on CPU using QUIP | by Zachariah Zhang | Medium
TLDR — Extractive question answering is an important task for providing a good user experience in many applications. The popular Retriever-Reader framework for QA using BERT can be difficult to scale…
5 months ago by @ghagerer
show all tags
scoring
retriever
question-answering
extractive
scoringretrieverquestion-answeringextractive
copydelete
- community post
- history of this post
1Building a Document-based Question Answering System with LangChain, Pinecone, and LLMs like GPT-4 and ChatGPT
Build document-based question-answering systems using LangChain, Pinecone, LLMs like GPT-4, and semantic search for precise, context-aware AI solutions.
a year ago by @ghagerer
show all tags
chatbots
llms
langchain
question-answering
chatbotsllmslangchainquestion-answering
copydelete
- community post
- history of this post
1BERT for unsupervised text tasks - ETHER Labs - Medium
Document embeddings and sentence relatedness using BERT
5 years ago by @ghagerer
show all tags
sentence-embeddings
bert
sentence-relatedness
document-embeddings
sentence-embeddingsbertsentence-relatednessdocument-embeddings
copydelete
- community post
- history of this post
1Topic Modeling with LSA, PLSA, LDA & lda2Vec
In natural language understanding (NLU) tasks, there is a hierarchy of lenses through which we can extract meaning — from words to sentences to paragraphs to documents. At the document level, one of the most useful ways to understand text is by analyzing its topics. The process of learning, recognizing, and extracting these topics across a collection of documents is called topic modeling. In this post, we will explore topic modeling through 4 of the most popular techniques today: LSA, pLSA, LDA, and the newer, deep learning-based lda2vec.
5 years ago by @ghagerer
show all tags
downprojection
preethi
clustering
lsa
pca
downprojectionpreethiclusteringlsapca
copydelete
- community post
- history of this post
1NLP Profiler - Profiling of Textual Dataset | Kaggle
Explore and run machine learning code with Kaggle Notebooks | Using data from multiple data sources
10 months ago by @ghagerer
show all tags
nlp-profiling
python
data-profiler
data-quality
nlp-profilingpythondata-profilerdata-quality
copydelete
- community post
- history of this post
1Building a Private ChatGPT Interface With Azure OpenAI – Baldacchino Automation
https://automation.baldacchino.net/building-a-private-chatgpt-interface-with-azure-openai/
a year ago by @ghagerer
show all tags
cloud
llms
gpt3
chatgpt
azure
cloudllmsgpt3chatgptazure
copydelete
- community post
- history of this post
1Module pulearn API documentation
The pulearn Python package provide a collection of scikit-learn wrappers to several positive-unlabled learning (PU-learning) methods. Features Scikit-learn compliant wrappers to prominent PU-learning methods. Fully tested on Linux, macOS and Windows systems. Compatible with Python 3.5+.
a year ago by @ghagerer
show all tags
python
open-source
pu-learning
pythonopen-sourcepu-learning
copydelete
- community post
- history of this post
1PCA — how to choose the number of components? | Bartosz Mikulski
In this article, I am going to show you how to choose the number of principal components when using principal component analysis for dimensionality reduction. In the first section, I am going to give you a short answer for those of you who are in a hurry and want to get something working. Later, I am going to provide a more extended explanation for those of you who are interested in understanding PCA.
a year ago by @ghagerer
show all tags
downprojection
dimension-reduction
unsupervised
pca
hyperparameter-optimization
downprojectiondimension-reductionunsupervisedpcahyperparameter-optimization
copydelete
- community post
- history of this post
1FastText and Gensim word embeddings | RARE Technologies
Facebook Research open sourced a great project recently – fastText, a fast (no surprise) and effective method to learn word representations and perform text classification. I was curious about comparing these embeddings to other commonly used embeddings, so word2vec seemed like the obvious choice, especially considering fastText embeddings are an extension of word2vec.
5 years ago by @ghagerer
show all tags
comparison
fasttext
word2vec
comparisonfasttextword2vec
copydelete
- community post
- history of this post
1What Is XLNet and Why It Outperforms BERT - Towards Data Science
Basic knowledge of XLNet to understand the difference between XLNet and BERT intuitively
5 years ago by @ghagerer
show all tags
xlnet
bert
xlnetbert
copydelete
- community post
- history of this post
1About « SenticNet
The main aim of SenticNet is to make the conceptual and affective information conveyed by natural language (meant for human consumption) more easily-accessible to machines.
4 years ago by @ghagerer
show all tags
senticnet
sentics
sentiment-analysis
senticnetsenticssentiment-analysis
copydelete
- community post
- history of this post
1Gaussian Mixture Model clustering: how to select the number of components (clusters)
You want to discern how many clusters we have (or, if you prefer, how many gaussians components generated the data), and you don’t have information about the “ground truth”. A real case, where data do not have the nicety of behaving good as the simulated ones.
4 years ago by @ghagerer
show all tags
optimal-k
clustering
gmms
optimal-kclusteringgmms
copydelete
- community post
- history of this post
1Topic Coherence To Evaluate Topic Models
Definition of NLP coherence scores, in particular intrinsic UMass measure and PMI. Human judgment not being correlated to perplexity (or likelihood of unseen documents) is the motivation for more work trying to model the human judgment. This is by itself a hard task as human judgment is not clearly defined; for example, two experts can disagree on the usefulness of a topic. One can classify the methods addressing this problem into two categories. \textit{Intrinsic} methods that do not use any external source or task from the dataset, whereas \textit{extrinsic} methods use the discovered topics for external tasks, such as information retrieval [Wei06], or use external statistics to evaluate topics.
4 years ago by @ghagerer
show all tags
coherence
coherence-score
coherencecoherence-score
copydelete
- community post
- history of this post

⟨⟨
⟨
1
2
3
⟩
⟩⟩

publications (hide)159
This list of posts may not be accurate to recent changes. If you want accurate posts, but with limited sorting follow this link.

display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

2Multi-entity Sentiment Scoring.
K. Moilanen, and S. Pulman. RANLP, page 258-263. RANLP 2009 Organising Committee / ACL, (2009)
5 years ago by @ghagerer
show all tags
inter-rater-agreement
sentiment-analysis
inter-rater-agreementsentiment-analysis
copydeleteadd this publication to your clipboard
7Cheap and Fast - But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks.
R. Snow, B. O'Connor, D. Jurafsky, and A. Ng. EMNLP '08: Proceedings of the Conference on Empirical Methods in Natural Language Processing, page 254--263. Morristown, NJ, USA, Association for Computational Linguistics, (2008)
4 years ago by @ghagerer
show all tags
annotation-bias
nlp
emotion-recognition
inter-rater-agreement
crowdsourcing
emotion
dataset
affective-computing
annotation-biasnlpemotion-recognitioninter-rater-agreementcrowdsourcingemotiondatasetaffective-computing
copydeleteadd this publication to your clipboard
2Sentiment Analysis and the Use of Extrinsic Datasets in Evaluation
A. Devitt, and K. Ahmad. Language Resources and Evaluation Conference (LREC 2008), (2008)
5 years ago by @ghagerer
show all tags
inter-rater-agreement
sentiment-analysis
inter-rater-agreementsentiment-analysis
copydeleteadd this publication to your clipboard
1An analysis of the relationship between individuals? perceptions of privacy and mobile phone location data - a grounded theory study
A. Gorra. Leeds Metropolitan University, (April 2007)
3 years ago by @ghagerer
show all tags
grounded-theory
qualitative-research
grounded-theoryqualitative-research
copydeleteadd this publication to your clipboard
1Graph Visualization Techniques for Web Clustering Engines
E. Giacomo, W. Didimo, L. Grilli, and G. Liotta. IEEE Transactions on Visualization and Computer Graphics, 13 (2): 294-304 (March 2007)
5 years ago by @ghagerer
show all tags
visualization
clustering
graph-based
visualizationclusteringgraph-based
copydeleteadd this publication to your clipboard
7A correlated topic model of Science
D. Blei, and J. Lafferty. Annals of Applied Statistics, (2007)
4 years ago by @ghagerer
show all tags
correlated-topic-modeling
topic-modeling
correlated-topic-modelingtopic-modeling
copydeleteadd this publication to your clipboard
7Correlated Topic Models
D. Blei, and J. Lafferty. Advances in Neural Information Processing Systems 18, 18, page 147. Cambridge, MA, MIT; 1998, (2006)
4 years ago by @ghagerer
show all tags
correlated-topic-modeling
topic-modeling
correlated-topic-modelingtopic-modeling
copydeleteadd this publication to your clipboard
1Evaluation of natural emotions using self assessment manikins
M. Grimm, and K. Kroschel. IEEE Workshop on Automatic Speech Recognition and Understanding, 2005., page 381--385. IEEE, (2005)
5 years ago by @ghagerer
show all tags
inter-rater-agreement
sentiment-analysis
inter-rater-agreementsentiment-analysis
copydeleteadd this publication to your clipboard
1Understanding interobserver agreement: the kappa statistic.
A. Viera, and J. Garrett. Family Medicine, (2005)
5 years ago by @ghagerer
show all tags
cohens-kappa
inter-rater-agreement
cohens-kappainter-rater-agreement
copydeleteadd this publication to your clipboard
11Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales
B. Pang, and L. Lee. (2005)
5 years ago by @ghagerer
show all tags
inter-rater-agreement
sentiment-analysis
inter-rater-agreementsentiment-analysis
copydeleteadd this publication to your clipboard
9Annotating Expressions of Opinions and Emotions in Language
J. Wiebe, T. Wilson, and C. Cardie. Language resources and evaluation, 39 (2-3): 165-210 (2005)
5 years ago by @ghagerer
show all tags
inter-rater-agreement
f1
sentiment-analysis
inter-rater-agreementf1sentiment-analysis
copydeleteadd this publication to your clipboard
6Active Semi-Supervision for Pairwise Constrained Clustering
S. Basu, A. Banerjee, and R. Mooney. Proceedings of the 2004 SIAM International Conference on Data Mining, page 333--344. Lake Buena Vista, FL, Society for Industrial and Applied Mathematics, (April 2004)
5 years ago by @ghagerer
show all tags
semi-supervised
clustering
pckmeans
kmeans
unsupervised
shabnam
semi-supervisedclusteringpckmeanskmeansunsupervisedshabnam
copydeleteadd this publication to your clipboard
18A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts
B. Pang, and L. Lee. Proceedings of the Association for Computational Linguistics (ACL), page 271--278. Association for Computational Linguistics, (2004)
4 years ago by @ghagerer
show all tags
sentiwordnet
sentiment-analysis
dictionary-based
sentiwordnetsentiment-analysisdictionary-based
copydeleteadd this publication to your clipboard
44Latent Dirichlet Allocation
D. Blei, A. Ng, and M. Jordan. Journal of Machine Learning Research, (January 2003)
4 years ago by @ghagerer
show all tags
lda
topic-modeling
ldatopic-modeling
copydeleteadd this publication to your clipboard
2Predicting query performance
S. Cronen-Townsend, Y. Zhou, and B. Croft. SIGIR '02: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, page 299--306. New York, NY, USA, ACM Press, (2002)
4 years ago by @ghagerer
show all tags
clarity-scoring
clarity-scoring
copydeleteadd this publication to your clipboard
2Recognizing Subjectivity: A Case Study of Manual Tagging
R. Bruce, and J. Wiebe. Natural Language Engineering, 5 (2): 187-205 (1999)
5 years ago by @ghagerer
show all tags
inter-rater-agreement
subjectivity-analysis
manual-tagging
case-study
inter-rater-agreementsubjectivity-analysismanual-taggingcase-study
copydeleteadd this publication to your clipboard
1Grounded theory methodology: An overview.
A. Strauss, and J. Corbin. (1994)
3 years ago by @ghagerer
show all tags
grounded-theory
content-study
qualitative-research
grounded-theorycontent-studyqualitative-research
copydeleteadd this publication to your clipboard
7Term Weighting Approaches in Automatic Text Retrieval
G. Salton, and C. Buckley. Publication, Cornell University, Ithaca, NY, USA, (1987)
3 years ago by @ghagerer
show all tags
tf-idf
tf-idf
copydeleteadd this publication to your clipboard
9Distributional structure
Z. Harris. Word, 10 (2-3): 146--162 (1954)
4 years ago by @ghagerer
show all tags
bag-of-words
bag-of-words
copydeleteadd this publication to your clipboard

⟨⟨
⟨
6
7
8
⟩
⟩⟩

bookmarks (hide)66 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

Dr. Gerhard Johann Hagerer

discussion

similar users

shared groups

tags

bookmarks (hide)66
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML