2 datasets found

Keywords: text cleaning

Filter Results
  • jusText

    jusText is a heuristic based boilerplate removal tool useful for cleaning documents in large textual corpora. The tool has been implemented in Python, licensed under New BSD...
  • CorpusExplorer

    Software for corpus linguists and text/data mining enthusiasts. The CorpusExplorer combines over 45 interactive visualizations under a user-friendly interface. Routine tasks...
You can also access this registry using the API (see API Docs).