User:Martind/Document Log Discovery Platform: Difference between revisions

From London Hackspace Wiki

Line 1: Line 1:
== Problem Statement ==
== Problem Statement ==


We're seeing an increase in the publication of vast corpuses of document logs, often in the form of message archives, usually in a structured message format. They're all quite overwhelming: how to make sense of such a vast amount of text? How to identify sections that are relevant?
We're seeing an increase in the publication of vast corpuses of document logs, often in the form of message archives, usually in a structured message format. They're all quite overwhelming: how to make sense of such a vast amount of text? How to identify sections that are relevant? When identified with a new corpus, how can I see what other people already found? How can I make my own findings available to others?


* Can we allow large number of interested parties to annotate these documents?
* Can we allow large number of interested parties to annotate these documents?