jaeschke > social | BibSonomy

bookmarks (hide)53
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1Compromising Twitter's OAuth security system
Twitter recently transitioned to OAuth, but the social network's implementation of the new authentication system has some serious flaws. Ars shows how easy it was to compromise the secret key of Twitter's own official client application for Android.
14 years ago by @jaeschke
show all tags
twitter
social
oauth
security
twittersocialoauthsecurity
copydelete
- community post
- history of this post
1Feature of the Week: Password Buddies
Lately, we've been working hard to improve BibSonomy's social features. With the recent release we introduced another unique feature that was not announced until now. Following the intuition that secrets are always shared among best friends, our idea is to connect you to people who have the same login password for BibSonomy as you. This is an outstanding feature that other social networking sites lack up to now - usually, you only get buddies recommended by some black-box algorithm. Our solution is more targeted towards the idea that great minds think alike, and hence choose the same password. So if you have the same password as other users in BibSonomy, you'll see them in the sidebar in the new "your password buddies" section: Just have a look at your personal page to get to know your possible new buddies. Please note that it is possible that some of your password buddies have another password than you because of the possible hash collisions of the MD5 algorithm. Unfortunately we can not solve this issue because we don't store the plain text password, but we are working on an extension of the MD5 algorithm that produces no collisions. Happy secret sharing! Your BibSonomy team
13 years ago by @jaeschke
show all tags
social
bibsonomynews
fotw
bibsonomy
blog
socialbibsonomynewsfotwbibsonomyblog
copydelete
- community post
- history of this post
1Crowdsourcing News, Events, and Resources
http://ir.ischool.utexas.edu/crowd/
12 years ago by @jaeschke
show all tags
crowdsourcing
cirg
social
computing
collective
human
intelligence
crowdsourcingcirgsocialcomputingcollectivehumanintelligence
copydelete
- community post
- history of this post
3So You Think You Have a Power Law — Well Isn't That Special?
Three-Toed Sloth Slow Takes from the Canopy (My Very Own Internet Tradition) June 15, 2007 « Reformatting in Progress | Main | Books to Read While the Algae Grow in Your Fur, May 2007 » So You Think You Have a Power Law — Well Isn't That Special? Regular readers who care about such things — I think there are about three of you — will recall that I have long had a thing about just how unsound many of the claims for the presence of power law distributions in real data are, especially those made by theoretical physicists, who, with some honorable exceptions, learn nothing about data analysis. (I certainly didn't.) I have even whined about how I should really be working on a paper about how to do all this right, rather than merely snarking in a weblog. As evidence that the age of wonders is not passed — and, more relevantly, that I have productive collaborators — this paper is now loosed upon the world: Aaron Clauset, CRS and M. E. J. Newman, "Power-law distributions in empirical data", arxiv:0706.1062, with code available in Matlab and R; forthcoming (2009) in SIAM Review Abstract: Power-law distributions occur in many situations of scientific interest and have significant consequences for our understanding of natural and man-made phenomena. Unfortunately, the empirical detection and characterization of power laws is made difficult by the large fluctuations that occur in the tail of the distribution. In particular, standard methods such as least-squares fitting are known to produce systematically biased estimates of parameters for power-law distributions and should not be used in most circumstances. Here we describe statistical techniques for making accurate parameter estimates for power-law data, based on maximum likelihood methods and the Kolmogorov-Smirnov statistic. We also show how to tell whether the data follow a power-law distribution at all, defining quantitative measures that indicate when the power law is a reasonable fit to the data and when it is not. We demonstrate these methods by applying them to twenty-four real-world data sets from a range of different disciplines. Each of the data sets has been conjectured previously to follow a power-law distribution. In some cases we find these conjectures to be consistent with the data while in others the power law is ruled out. The paper is deliberately aimed at physicists, so we assume some things that they know (like some of the mechanisms, e.g. critical fluctuations, which can lead to power laws), and devote extra detail to things they don't but which e.g. statisticians do know (such as how to find the cumulative distribution function of a standard Gaussian). In particular, we refrained from making a big deal about the need for an error-statistical approach to problems like this, but it definitely shaped our thinking. Aaron has already posted about the paper, but I'll do so myself anyway. Partly this is to help hammer the message home, and partly this is because I am a basically negative and critical man, and this sort of work gives me an excuse to vent my feelings of spite under the pretense of advancing truth (unlike Aaron and Mark, who are basically nice guys and constructive scholars). Here are the take-home points, none of which ought to be news, but which, taken together, would lead to a real change in the literature. (For example, half or more each issue of Physica A would disappear.) 1. Lots of distributions give you straight-ish lines on a log-log plot. True, a Gaussian or a Poisson won't, but lots of other things will. Don't even begin to talk to me about log-log plots which you claim are "piecewise linear". 2. Abusing linear regression makes the baby Gauss cry. Fitting a line to your log-log plot by least squares is a bad idea. It generally doesn't even give you a probability distribution, and even if your data do follow a power-law distribution, it gives you a bad estimate of the parameters. You cannot use the error estimates your regression software gives you, because those formulas incorporate assumptions which directly contradict the idea that you are seeing samples from a power law. And no, you cannot claim that because the line "explains" (really, describes) a lot of the variance that you must have a power law, because you can get a very high R^2 from other distributions (that test has no "power"). And this is without getting into the additional errors caused by trying to fit a line to binned histograms. It's true that fitting lines on log-log graphs is what Pareto did back in the day when he started this whole power-law business, but "the day" was the 1890s. There's a time and a place for being old school; this isn't it. 3. Use maximum likelihood to estimate the scaling exponent. It's fast! The formula is easy! Best of all, it works! The method of maximum likelihood was invented in 1922 [parts 1 and 2], by someone who studied statistical mechanics, no less. The maximum likelihood estimators for the discrete (Zipf/zeta) and continuous (Pareto) power laws were worked out in 1952 and 1957 (respectively). They converge on the correct value of the scaling exponent with probability 1, and they do so efficiently. You can even work out their sampling distribution (it's an inverse gamma) and so get exact confidence intervals. Use the MLEs! 4. Use goodness of fit to estimate where the scaling region begins. Few people pretend that the whole of their data-set follows a power law distribution; usually the claim is about the right or upper tail, the large values over some given threshold. This ought to raise the question of where the tail begins. Usually people find it by squinting at their log-log plot. Mark Handcock and James Jones, in one of the few worthwhile efforts here, suggested using Schwarz's information criterion. This isn't bad, but has trouble with with continuous data. Aaron devised an ingenious method which finds the empirically-best scaling region, by optimizing the Kolmogorov-Smirnov goodness-of-fit statistic; it performs slightly better than the information criterion. (Yes, one could imagine more elaborate semi-parametric approaches to this problem. Feel free to go ahead and implement them.) 5. Use a goodness-of-fit test to check goodness of fit. In particular, if you're looking at the goodness of fit of a distribution, use a statistic meant for distributions, not one for regression curves. This means forgetting about R^2, the fraction of variance accounted for by the curve, and using the Kolmogorov-Smirnov statistic, the maximum discrepancy between the empirical distribution and the theoretical one. If you've got the right theoretical distribution, KS statistic will converge to zero as you get more data (that's the Glivenko-Cantelli theorem). The one hitch in this case is that you can't use the usual tables/formulas for significance levels, because you're estimating the parameters of the power law from the data. (If you really want to see where the problem comes from, see Pollard, starting on p. 99.) This is why God, in Her wisdom and mercy, gave us the bootstrap. If the chance of getting data which fits the estimated distribution as badly as your data fits your power law is, oh, one in a thousand or less, you had better have some other, very compelling reason to think that you're looking at a power law. 6. Use Vuong's test to check alternatives, and be prepared for disappointment. Even if you've estimated the parameters of your parameters properly, and the fit is decent, you're not done yet. You also need to see whether other, non-power-law distributions could have produced the data. This is a model selection problem, with the complication that possibly neither the power law nor the alternative you're looking at is exactly right; in that case you'd at least like to know which one is closer to the truth. There is a brilliantly simple solution to this problem (at least for cases like this) which was first devised by Quang Vuong in a 1989 Econometrica paper: use the log-likelihood ratio, normalized by an estimate of the magnitude of the fluctuations in that ratio. Vuong showed that this test statistic asymptotically has a standard Gaussian distribution when the competing models are equally good; otherwise it will almost surely converge on picking out the better model. This is extremely clever and deserves to be much better known. And, unlike things like the fit to a log-log regression line, it actually has the power to discriminate among the alternatives. If you use sensible, heavy-tailed alternative distributions, like the log-normal or the Weibull (stretched exponential), you will find that it is often very, very hard to rule them out. In the two dozen data sets we looked at, all chosen because people had claimed they followed power laws, the log-normal's fit was almost always competitive with the power law, usually insignificantly better and sometimes substantially better. (To repeat a joke: Gauss is not mocked.) For about half the data sets, the fit is substantially improved by adding an exponential cut-off to the power law. (I'm too lazy to produce the necessary equations; read the paper.) This means that there is a characteristic scale after all, and that super-mega-enormous, as opposed to merely enormous, events are, indeed, exponentially rare. Strictly speaking, a cut-off power law should always fit the data better than a pure one (just let the cut-off scale go to infinity, if need be), so you need to be a little careful in seeing whether the improvement is real or just noise; but often it's real. 7. Ask yourself whether you really care. Maybe you don't. A lot of the time, we think, all that's genuine important is that the tail is heavy, and it doesn't really matter whether it decays linearly in the log of the variable (power law) or quadratically (log-normal) or something else. If that's all that matters, then you should really consider doing some kind of non-parametric density estimation (e.g. Markovitch and Krieger's [preprint]). Sometimes, though, you do care. Maybe you want to make a claim which depends heavily on just how common hugely large observations are. Or maybe you have a particular model in mind for the data-generating process, and that model predicts some particular distribution for the tail. Then knowing whether it really is a power law, or closer to a power law than (say) a stretched exponential, actually matters to you. In that case, you owe it to yourself to do the data analysis right. You also owe it to yourself to think carefully about whether there are other ways of checking your model. If the only testable prediction it makes is about the shape of the tail, it doesn't sound like a very good model, and it will be intrinsically hard to check it. Because this is, of course, what everyone ought to do with a computational paper, we've put our code online, so you can check our calculations, or use these methods on your own data, without having to implement them from scratch. I trust that I will no longer have to referee papers where people use GnuPlot to draw lines on log-log graphs, as though that meant something, and that in five to ten years even science journalists and editors of Wired will begin to get the message. Manual trackbacks: The Statistical Mechanic; Uncertain Principles; zs; LanguageLog; Science After Sunclipse; Philosophia Naturalis 11 (at Highly Allochthonous); Langreiter; blogs for industry ... blogs for the dead; Infectious Greed; No Free Lunch; Look Here First; TPMCafe; Science After Sunclipse; Cosmic Variance; Messy Matters Power Laws; Enigmas of Chance; Complexity Posted by crshalizi at June 15, 2007 13:00 | permanent link Three-Toed Sloth: Hosted, but not endorsed, by the Center for the Study of Complex Systems
13 years ago by @jaeschke
show all tags
powerlaw
social
sna
analysis
network
powerlawsocialsnaanalysisnetwork
copydelete
- community post
- history of this post
1Collecting and Visualizing Twitter Network Data with NodeXl and Gephi - Social Dynamics
http://social-dynamics.org/twitter-network-data/
9 years ago by @jaeschke
show all tags
twitter
social
visualization
nodexl
data
gephi
analysis
network
twittersocialvisualizationnodexldatagephianalysisnetwork
copydelete
- community post
- history of this post
1Home | The Social Computer
http://socialcomputer.eu/
12 years ago by @jaeschke
show all tags
social
computing
socialcomputing
copydelete
- community post
- history of this post
1Welcome to iamResearcher
The Open Knowledge Network
11 years ago by @jaeschke
show all tags
web
social
researcher
network
gaw
research
websocialresearchernetworkgawresearch
copydelete
- community post
- history of this post
1AdamNation.org
where is tagging going?
19 years ago by @jaeschke
show all tags
tagging
social
folksonomy
taxonomy
taggingsocialfolksonomytaxonomy
copydelete
- community post
- history of this post
4SNARF from Microsoft Research
the Social Network and Relationship Finder
18 years ago by @jaeschke
show all tags
seminar2006
relationship
social
email
network
microsoft
seminar2006relationshipsocialemailnetworkmicrosoft
copydelete
- community post
- history of this post
4foaf+ssl - ESW Wiki
FOAF+SSL is a secure authentication protocol that enables the building of distributed open yet secure social networks.
15 years ago by @jaeschke
show all tags
authentication
dagsocial
system
social
profile
ssl
network
security
login
foaf
authenticationdagsocialsystemsocialprofilesslnetworksecurityloginfoaf
copydelete
- community post
- history of this post
9Social Semantic Cloud of Tags » SCOT:Let’s Share Tags!
The SCOT(Social Semantic Cloud Of Tags) ontology is to semantically represent the structure and semantics of a collection of tags and to represent social networks among users based on the tags.
17 years ago by @jaeschke
show all tags
tagging
cloud
scot
semantic
web
social
folksonomy
tag
ontology
taggingcloudscotsemanticwebsocialfolksonomytagontology
copydelete
- community post
- history of this post
3Statistical Physics of Social Dynamics
Statistical Physics of Social Dynamics: Opinions, Semiotic Dynamics, and Language
17 years ago by @jaeschke
show all tags
social
sna
summerschool
analysis
network
socialsnasummerschoolanalysisnetwork
copydelete
- community post
- history of this post
2E-Valuation of Information Systems - LinkRank: Finding People of Similar Interests (e.g. on del.icio.us)
How to find people with similar interests on del.icio.us or flickr or other social software?
17 years ago by @jaeschke
show all tags
social
interest
folksonomy
ranking
community
linkrank
bookmarking
socialinterestfolksonomyrankingcommunitylinkrankbookmarking
copydelete
- community post
- history of this post
2Gesellschaftsklassen - Gesellschaft - Leben - ZEIT online
Man zieht in gute Viertel, schickt die Kinder auf Privatschulen, achtet auf Stil und Manieren: Das Bürgertum grenzt sich ab – und erschwert Menschen aus den unteren Schichten den Aufstieg.
17 years ago by @jaeschke
show all tags
gesellschaft
social
sna
analysis
network
society
gesellschaftsocialsnaanalysisnetworksociety
copydelete
- community post
- history of this post
1TP: Wikipedia soziologisch betrachtet: Interview mit dem Netzwerkforscher Christian Stegbauer
Die deutschsprachige Wikipedia-Community steht nach Jahren kontinuierlichen Wachstums derzeit in der [extern] Kritik. Im September hatte sich ein offenbar länger gärender Unmut über die Löschkriterien entladen, die hierzulande [extern] restriktiver gehandhabt werden als im insoweit toleranteren englischsprachigen Vorbild, das etwa [extern] Stubs, also rudimentäre Artikelansätze zulässt. Auch die Rekrutierung neuer Autoren stagniert. International ist sogar von einem [extern] Autorenschwund, der sich innerhalb eines Jahres [extern] verzehnfacht habe. Urheber neuangelegter Artikel werden in der deutschsprachigen Wikipedia oft durch Löschungen vor den Kopf gestoßen; wer uneingeladen an Artikeln arbeitet, deren Struktur von einem etablierten Wikipedianer geprägt wurde, muss mit heftigem Gegenwind rechnen. Andererseits gilt die deutschsprachige Wikipedia als vergleichsweise anspruchsvoll, was nicht zuletzt der Aufmerksamkeit der aktiven Wikipedianer geschuldet ist, die jede Änderung argwöhnisch in Minutenschnelle kontrollieren. Die internen Machtstrukturen, welche zahlreiche [extern] potentielle Bearbeiter ausschließen, hat der Netzwerkforscher [extern] Christian Stegbauer in seinem Buch [extern] "Wikipedia. Das Rätsel der Kooperation" (2009) analysiert.
15 years ago by @jaeschke
show all tags
social
wikipedia
interview
community
socialwikipediainterviewcommunity
copydelete
- community post
- history of this post
1The Impact of Social Computing on the EU Information Society and Economy
http://ftp.jrc.es/EURdoc/JRC54327.pdf
15 years ago by @jaeschke
show all tags
social
impact
computing
eu
socialimpactcomputingeu
copydelete
- community post
- history of this post
2Collective Awareness
The Collective Awareness Platforms for Sustainability and Social Innovation (CAPS) are ICT systems leveraging the emerging "network effect" by combining open online social media, distributed knowledge creation and data from real environments (Internet of Things), in order to create new forms of social innovation. They are expected to support environmentally aware, grassroots processes and practices to share knowledge, to achieve changes in lifestyle, production and consumption patterns, and to set up more participatory democratic processes.
12 years ago by @jaeschke
show all tags
social
awareness
collective
mol
intelligence
socialawarenesscollectivemolintelligence
copydelete
- community post
- history of this post
4ICWSM Datasets
http://icwsm.cs.mcgill.ca/
12 years ago by @jaeschke
show all tags
icwsm
web
dataset
twitter
social
icwsmwebdatasettwittersocial
copydelete
- community post
- history of this post
1Collective Awareness
http://ec.europa.eu/information_society/activities/collectiveawareness/links/index_en.htm
12 years ago by @jaeschke
show all tags
web
social
awareness
collective
caps
intelligence
platform
websocialawarenesscollectivecapsintelligenceplatform
copydelete
- community post
- history of this post
1Knowledge and Data Engineering :: Data Sets
https://www.kde.cs.uni-kassel.de/datasets/
12 years ago by @jaeschke
show all tags
dataset
citation
social
bookmarking
datasetcitationsocialbookmarking
copydelete
- community post
- history of this post

publications (hide)240
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

11Finding high-quality content in social media
E. Agichtein, C. Castillo, D. Donato, A. Gionis, and G. Mishne. Proceedings of the international conference on Web search and web data mining, page 183--194. New York, NY, USA, ACM, (2008)
12 years ago by @jaeschke
show all tags
answering
web
social
question
collaborative
reputation
alexandria
media
search
quality
answeringwebsocialquestioncollaborativereputationalexandriamediasearchquality
copydeleteadd this publication to your clipboard
3Can people collaborate to improve the relevance of search results?
A. Agrahri, D. Manickam, and J. Riedl. Proceedings of the 2008 ACM conference on Recommender systems, page 283--286. New York, NY, USA, ACM, (2008)
12 years ago by @jaeschke
show all tags
web
social
collaborative
search
websocialcollaborativesearch
copydeleteadd this publication to your clipboard
3The jabberwocky programming environment for structured social computing
S. Ahmad, A. Battle, Z. Malkani, and S. Kamvar. Proceedings of the 24th annual ACM symposium on User interface software and technology, page 53--64. New York, NY, USA, ACM, (2011)
12 years ago by @jaeschke
show all tags
cirg
social
computing
collective
human
intelligence
programming
cirgsocialcomputingcollectivehumanintelligenceprogramming
copydeleteadd this publication to your clipboard
11Analysis of topological characteristics of huge online social networking services
Y. Ahn, S. Han, H. Kwak, S. Moon, and H. Jeong. Proceedings of the 16th International Conference on World Wide Web, page 835--844. New York, NY, USA, ACM, (2007)
15 years ago by @jaeschke
show all tags
social
sna
folksonomy
online
analysis
network
socialsnafolksonomyonlineanalysisnetwork
copydeleteadd this publication to your clipboard
3Recommender Systems for the Social Web
J. and Pazos Arias, A. Fernández Vilas, and R. Díaz Redondo (Eds.) Intelligent Systems Reference Library Springer, Berlin/Heidelberg, (2012)
12 years ago by @jaeschke
show all tags
stair
system
web
social
recommender
stairsystemwebsocialrecommender
copydeleteadd this publication to your clipboard
14Enhancing Social Interactions at Conferences
M. Atzmueller, D. Benz, S. Doerfel, A. Hotho, R. Jäschke, B. Macek, F. Mitzlaff, C. Scholz, and G. Stumme. Information Technology, 53 (3): 101--107 (May 2011)
13 years ago by @jaeschke
show all tags
myown
ubiquitous
conference
social
computing
2011
conferator
rfid
network
myownubiquitousconferencesocialcomputing2011conferatorrfidnetwork
copydeleteadd this publication to your clipboard
2Next Generation Web Search
R. Baeza-Yates, and P. Raghavan. Search Computing, volume 5950 of Lecture Notes in Computer Science, chapter 2, Springer, Berlin/Heidelberg, (2010)
12 years ago by @jaeschke
show all tags
web
social
search
websocialsearch
copydeleteadd this publication to your clipboard
10Everyone's an Influencer: Quantifying Influence on Twitter
E. Bakshy, J. Hofman, W. Mason, and D. Watts. Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, page 65--74. New York, NY, USA, ACM, (2011)
11 years ago by @jaeschke
show all tags
diffusion
toread
twitter
social
sna
analysis
network
influence
diffusiontoreadtwittersocialsnaanalysisnetworkinfluence
copydeleteadd this publication to your clipboard
4Social influence and the diffusion of user-created content
E. Bakshy, B. Karrer, and L. Adamic. Proceedings of the 10th ACM Conference on Electronic Commerce, page 325--334. New York, NY, USA, ACM, (2009)
4 years ago by @jaeschke
show all tags
cascade
diffusion
twitter
social
sna
webscience
viral
analysis
network
influence
cascadediffusiontwittersocialsnawebscienceviralanalysisnetworkinfluence
copydeleteadd this publication to your clipboard
14Recommender Systems for Social Tagging Systems
L. Balby Marinho, A. Hotho, R. Jäschke, A. Nanopoulos, S. Rendle, L. Schmidt-Thieme, G. Stumme, and P. Symeonidis. SpringerBriefs in Electrical and Computer Engineering Springer, (February 2012)
12 years ago by @jaeschke
show all tags
tagging
myown
social
folksonomy
collaborative
2012
recommender
bookmarking
taggingmyownsocialfolksonomycollaborative2012recommenderbookmarking
copydeleteadd this publication to your clipboard

⟨⟨
⟨
1
2
3
⟩
⟩⟩

BibSonomy

bookmarks (hide)53
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

1Compromising Twitter's OAuth security system

1Feature of the Week: Password Buddies

1Crowdsourcing News, Events, and Resources

3So You Think You Have a Power Law — Well Isn't That Special?

1Collecting and Visualizing Twitter Network Data with NodeXl and Gephi - Social Dynamics

1Home | The Social Computer

1Welcome to iamResearcher

1AdamNation.org

4SNARF from Microsoft Research

4foaf+ssl - ESW Wiki

9Social Semantic Cloud of Tags » SCOT:Let’s Share Tags!

3Statistical Physics of Social Dynamics

2E-Valuation of Information Systems - LinkRank: Finding People of Similar Interests (e.g. on del.icio.us)

2Gesellschaftsklassen - Gesellschaft - Leben - ZEIT online

1TP: Wikipedia soziologisch betrachtet: Interview mit dem Netzwerkforscher Christian Stegbauer

1The Impact of Social Computing on the EU Information Society and Economy

2Collective Awareness

4ICWSM Datasets

1Collective Awareness

1Knowledge and Data Engineering :: Data Sets

publications (hide)240
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...

11Finding high-quality content in social media

3Can people collaborate to improve the relevance of search results?

3The jabberwocky programming environment for structured social computing

11Analysis of topological characteristics of huge online social networking services

3Recommender Systems for the Social Web

14Enhancing Social Interactions at Conferences

2Next Generation Web Search

10Everyone's an Influencer: Quantifying Influence on Twitter

4Social influence and the diffusion of user-created content

14Recommender Systems for Social Tagging Systems

browse

related tags

concepts

tags

bookmarks (hide)53 displayallbookmarks onlybookmarks per page5102050100 sort byadded attitle RSSBibTeXXML

publications (hide)240 displayallpublications onlypublications per page5102050100 sort byadded attitleauthorpublication dateentry typehelp for advanced sorting... RSSBibTeXRDFmore...

browse

related tags

tags

bookmarks (hide)53
display
all
bookmarks only
bookmarks per page
5
10
20
50
100
sort by
added at
title
RSS
BibTeX
XML

publications (hide)240
display
all
publications only
publications per page
5
10
20
50
100
sort by
added at
title
author
publication date
entry type
help for advanced sorting...
RSS
BibTeX
RDF
more...