Further details on the data are in the description.
description | structure | granularity | format | comments |
---|---|---|---|---|
Unknown structure | <empty> | <empty> | <empty> | any data; e.g., original data |
Documents aligned at file level | document | file | dirlang | metadata directory for certain formats |
Documents aligned at paragraph level | document | paragraph | dirlang | format in file extension |
Documents aligned at sentence level | document | sentence | sqlite | format in file extension or field |
Documents aligned at subsentence level | document | subsentence | dirlang | |
Segments aligned at paragraph level in the format TMX | segment | paragraph | tmx | |
Segments aligned at sentence level in the format TMX | segment | sentence | tmx | |
Segments aligned at subsentence level in the format TMX | segment | subsentence | tmx | Same content, different format |
Segments aligned at subsentence level in the format column-file | segment | subsentence | columnfile | Same content, different format |
Segments aligned at subsentence level the format SQLite | segment | subsentence | sqlite | Same content, different format |
dirlang
: directory with language codes