from french to #corsicanlanguage: scoring 1 – 11/128 = 91%. (rough) typology of remaining errors:
– grammatical errors (partitive article, auxiliary reversal, locutions, etc.: 35% (difficulty: medium)
– lack of vocabulary: 25% (easy)
– semantic disambiguation: 20% (hard)
– grammatical disambiguation: 10% (medium)
– false positives: for example proper names that are translated instead of remaining untranslated: 10% (medium)
– formal errors (punctuation, uppercase/lowercase, elision, …): 10% (medium)
– other grammatical errors: 10% (medium to hard)
Monthly Archives: June 2016
June 24, 2016: scoring 90%
Testing rule-based translation : scoring 1 – 13/137 = 90%
some lack of vocabulary
but mostly grammatical errors: partitive article (french ‘de’), etc.
June 6, 2016: Scoring 91%
testing #machine translation from french to corsican. Scoring 1 – 12 / 136 = 91%.
Of interest:
– XVIe seculu (fr) = 16th century = XVIu seculu / seculu XVI (co)
– caution: sample from fr wk’s first page is convenient for its randomness, but
can not be considered as truly random text: some verb tenses are missing
in this type of articles, conversation and dialogs are scarce
June 2, 2016: scoring 88%
Sample test, interesting stuff:
– scoring 1 – 13/110 = 88% on this sample
– hard case: disambiguation of fils (fr) = son/sons (en= ) figliolu/figlioli (co)
– caution: sample from fr wk’s first page is convenient for its randomness, but
can not be considered as truly random text: some verb tenses are missing in this type of articles, conversation and dialogs are scarce