HOME    »    PROGRAMS/ACTIVITIES    »    Annual Thematic Program
Talk Abstract
Online Text Classification with ATTICS

David D. Lewis
AT&T Labs
lewis@research.att.com

 

ATTICS is a C++ platform implemented at AT&T Labs for training and use of predictive models on mixed text and nontext data. In spirit it is a hybrid between text retrieval systems such as SMART and machine learning toolkits such as MLC++. The design, data model, and emphasis on online classifier application are unusual for either type of software. The term weighting and supervised learning techniques used in information retrieval were developed in the context of ranked retrieval from relatively static text databases. I will discuss how we implemented these techniques in the online setting of ATTICS, and the research questions this exercise raises.


Back to Workshop Schedule

Back to IMA "HOT TOPICS" Workshop: Text Mining

Go