2 datasets found

Keywords: blogs

Filter Results
  • Blog post and comment corpus Janes-Blog 1.0

    Janes-Blog is an annotated corpus of Slovene blogs from websites rtvslo.si and publishwall.si from the period 2006-10 to 2016-01. The corpus is structured into individual texts...
  • Corpus of contemporary blogs

    In NLP Centre, dividing text into sentences is currently done with a tool which uses rule-based system. In order to make enough training data for machine learning, annotators...
You can also access this registry using the API (see API Docs).