Tuning your PostgreSQL database is somewhat of a black art. While documentation does exist on the topic, many people still find it hard to get all the power out of their system. This article aims to help demystify PostgreSQL database performance tuning.
Query log data for ad targeting
A WWW2006 paper out of Microsoft Research, "Finding Advertising Keywords on Web Pages" (PDF), claims that query log data is particularly useful for ad targeting.
Specifically, the researchers extracted from MSN query logs the keywords some people used to find a given page. They tested using that as one of many features for ad targeting. In their results, it was one of the most effective features.
Very interesting. It has always been harder to target ads to content than to search results because intent is much less clear.
By using the query log data in this way, the researchers were effectively using the intent of the searchers that arrived at the page as a proxy for the intent of everyone who arrived at the page.
select min(seq) seq,state,count(*) numb_ops, -> round(sum(duration),5) sum_dur, round(avg(duration),5) avg_dur, -> round(sum(cpu_user),5) sum_cpu, round(avg(cpu_user),5) avg_cpu -> from information_schema.profiling -> where query_id = 7 -> group by state -> order by seq;
R. Baeza-Yates, and A. Tiberi. KDD '07: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, page 76--85. New York, NY, USA, ACM, (2007)
R. Baeza-Yates, C. Hurtado, and M. Mendoza. Current Trends in Database Technology - EDBT 2004 Workshops, volume 3268 of Lecture Notes in Computer Science, Springer Berlin Heidelberg, (2005)
A. Marcus, E. Wu, S. Madden, and R. Miller. Proceedings of the 5th Biennial Conference on Innovative Data Systems Research, page 211--214. CIDR, (January 2011)
B. Krause, A. Hotho, and G. Stumme. Advances in Information Retrieval, 30th European Conference on IR Research, ECIR 2008, 4956, page 101-113. Springer, (2008)