> I am imagining a system that can parse papers from various sources > (web/files/etc) and in various formats (text, pdf, etc) and can store > metadata for this paper ,some kind of global ID if applicable, authors, > areas of research, whether the paper is "new", "highlighted", > "historical", type Those three categories won't help much. I'm sure though you had something specific in mind with them ? Karsten