David Allison
UC Berkeley

What kinds of syntactic, semantic, or pragmatic information may be captured from a text document that could be most useful in classifying that document into one or more predefined categories or classes?

A general overview of automatic document classification will be presented, and suggestions will be taken regarding possible ways to use syntactic, semantic, and pragmatic information to classify documents into different categories.

Some information presented will be: