DZ Interset

PID

DZ Interset is a means of converting among various tag sets in natural language processing. The core idea is similar to interlingua-based machine translation. DZ Interset defines a set of features that are encoded by the various tag sets. The set of features should be as universal as possible. It does not need to encode everything that is encoded by any tag set but it should encode all information that people may want to access and/or port from one tag set to another.

New tag sets are attached by writing a driver for them. Once the driver is ready, you can easily convert tags between the new set and any other set for which you also have a driver. This reusability is an obvious advantage over writing a targeted conversion procedure each time you need to convert between a particular pair of tag sets.

Identifier
PID http://hdl.handle.net/11858/00-097C-0000-0007-70FD-E
Related Identifier https://wiki.ufal.ms.mff.cuni.cz/user:zeman:interset
Metadata Access http://lindat.mff.cuni.cz/repository/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:lindat.mff.cuni.cz:11858/00-097C-0000-0007-70FD-E
Provenance
Creator Zeman, Daniel
Publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
Publication Year 2006
Rights GNU General Public License, version 2; http://www.gnu.org/licenses/gpl-2.0.html; PUB
OpenAccess true
Contact lindat-help(at)ufal.mff.cuni.cz
Representation
Resource Type toolService
Format application/zip; text/plain; charset=utf-8; downloadable_files_count: 1
Discipline Linguistics