Tuesday, January 29, 2008

Topic Tool

Today, I put together a web page topic tool by using the web service that Nathan Davis made available. The tool takes in one or more web pages (i.e., a list of URLs) and then extracts the topics given the text on the web pages. The topics, or more accurately, the most likely topic components are extracted using an algorithm called Latent Dirichlet Allocation (LDA).

One potential use is for quickly generating blogger profiles to be used for implicit affinity networks. You can try it out at:

http://dml.cs.byu.edu/matthewsmith/tools/topictool/

Nathan uses his web service to make the query expansion service for Google searches called GooEgg.

1 comment:

Anonymous said...

Who knows where to download XRumer 5.0 Palladium?
Help, please. All recommend this program to effectively advertise on the Internet, this is the best program!