Further details on the data are in the description.
| description | structure | granularity | format | comments |
|---|---|---|---|---|
| Unknown structure | <empty> | <empty> | <empty> | any data; e.g., original data |
| Documents aligned at file level | document | file | dirlang | metadata directory for certain formats |
| Documents aligned at paragraph level | document | paragraph | dirlang | format in file extension |
| Documents aligned at sentence level | document | sentence | sqlite | format in file extension or field |
| Documents aligned at subsentence level | document | subsentence | dirlang | |
| Segments aligned at paragraph level in the format TMX | segment | paragraph | tmx | |
| Segments aligned at sentence level in the format TMX | segment | sentence | tmx | |
| Segments aligned at subsentence level in the format TMX | segment | subsentence | tmx | Same content, different format |
| Segments aligned at subsentence level in the format column-file | segment | subsentence | columnfile | Same content, different format |
| Segments aligned at subsentence level the format SQLite | segment | subsentence | sqlite | Same content, different format |
dirlang: directory with language codes