File name holds temporal information.
dBdata files: contains gathered tweets, inferred language and their corresponding cluster identifier
clusterID : cluster identifier
tweetID : tweet identifier
lang : language detected
result files : holds the results of the TF-IDF, Pagerank and LDA applied. These are the extracted n-grams, ordered and sepparated by \";\"
clusterID : cluster identifier
tfidfRES : tfidf result
pagerankRES : pagerank result
ldaRES : lda result
These files were generated by a continuous pattern recognition on db data
http://hdl.handle.net/11304/93e8b144-5ada-46bb-974f-786b21b1ab06