MLW-LT

XLIFF/MT Round-Tripping


Demonstration, MLW Workshop, Rome, Luxembourg Mar 12.-15. 2013

Test site with XLIFF/MT Round-Trip web service is available.

  • Implemented as a part of Solas workflow (Web-service based with it's WSDL)
  • XLIFF in, XLIFF out
  • Processed ITS 2.0 mapped into XLIFF
  • Consumes data categories: Translate, Domain and Text Analysis.
  • Generates metadata for data categories: Provenance and MT Confidence

 

Inputs

Translate
    <trans-unit id="#4">
        <source xml:lang="English">
            The first 'Classic' writer was Aulus Gellius a 2nd-century Roman writer who 
            in the miscellany Noctes Atticae (19  8  15) refers to a writer as a
            <mrk mtype="protected">Classicus scriptor  non proletarius</mrk> 
            ('A distinguished  not a commonplace writer').
        </source>
    </trans-unit>

 
Domain
    <trans-unit id="#1" itsx:domain="classical-studies">
        <source xml:lang="English">Classical Studies</source>
    </trans-unit>

 
Text Analysis
<trans-unit id="#3" itsx:domain="ITS-Example">
    <source xml:lang="en-us">
        From the canyons of 
        <mrk mtype="x-its" its:taConfidence="0.7" its:taClassRef="http://nerd.eurecom.fr/ontology#Place" 
          its:taIdentRef="http://dbpedia.org/resource/Arizona">Arizona</mrk>, 
        to the Khmer temples deep in the jungle; from the tropical beaches of Queensland to the glaciers of 
        <mrk mtype="x-its" its:taConfidence="0.7" its:taClassRef="http://nerd.eurecom.fr/ontology#Place" 
          its:taIdentRef="http://dbpedia.org/resource/Antarctica">Antarctica</mrk>; 
        or from the wild savanna of 
        <mrk mtype="x-its" its:taConfidence="0.7" its:taClassRef="http://nerd.eurecom.fr/ontology#Place" 
          its:taIdentRef="http://dbpedia.org/resource/Africa">Africa</mrk>
        to mysterious castles in the forests of Bohemia, our offer takes you in some of the most amazing 
        places on our planet.
    </source>
</trans-unit>

 

Output

XLIFF File enhanced by MT (in <alt-trans> element)
 
Translate (output)
<trans-unit id="#4">
    <source xml:lang="English">
        The first 'Classic' writer was Aulus Gellius a 2nd-century Roman writer 
        who in the miscellany Noctes Atticae (19 8 15) refers to a writer as 
        a <mrk mtype="protected">Classicus scriptor non proletarius</mrk>
        ('A distinguished not a commonplace writer'). 
    </source>
    <alt-trans match-quality="0.749" origin="MT" its:provenanceRecordsRef="#pr3"
                its:annotatorsRef="mtconfidence|http://mlwlt.moravia.com/mlwlt-service-xliff-mt/mlwlt-service.asmx">
        <target xml:lang="Spanish">
            La primera 'Classic' Writer era aulus gellius un 2nd-century Roman Writer 
            que en la miscellany noctes atticae (19 8 15) refers a una Writer como un
            <mrk mtype="protected">Classicus scriptor non proletarius</mrk>
            ('un Distinguished no una moneda corriente Writer').
        </target>
    </alt-trans>
</trans-unit>

 
Text Analysis (output)
In case of Text analysis metadata presence, translation is taken from the referenced site instead of MT.
<trans-unit id="#3" itsx:domain="ITS-Example">
    <source xml:lang="en-us">
        From the canyons of 
        <mrk mtype="x-its" its:taConfidence="0.7" its:taClassRef="http://nerd.eurecom.fr/ontology#Place" 
          its:taIdentRef="http://dbpedia.org/resource/Arizona">Arizona</mrk>, 
        to the Khmer temples deep in the jungle; from the tropical beaches of Queensland to the glaciers of 
        <mrk mtype="x-its" its:taConfidence="0.7" its:taClassRef="http://nerd.eurecom.fr/ontology#Place" 
          its:taIdentRef="http://dbpedia.org/resource/Antarctica">Antarctica</mrk>; 
        or from the wild savanna of 
        <mrk mtype="x-its" its:taConfidence="0.7" its:taClassRef="http://nerd.eurecom.fr/ontology#Place" 
          its:taIdentRef="http://dbpedia.org/resource/Africa">Africa</mrk>
        to mysterious castles in the forests of Bohemia, our offer takes you in some of the most amazing 
        places on our planet.
    </source>
    <alt-trans mid="0" match-quality="0.749" origin="MT" its:provenanceRecordsRef="#pr3"
                its:annotatorsRef="mtconfidence|http://mlwlt.moravia.com/mlwlt-service-xliff-mt/mlwlt-service.asmx">
        <source xml:lang="en-us">
            From the canyons of ...
        </source>
        <target xml:lang="fr-fr">
            Des canyons de 
            <mrk mtype="x-its" its:taConfidence="0.7" its:taClassRef="http://nerd.eurecom.fr/ontology#Place" 
              its:taIdentRef="http://dbpedia.org/resource/Arizona">Arizona</mrk>,
            à la Khmers temples profonde dans la jungle tropicale; des plages du Queensland à les glaciers de 
            <mrk mtype="x-its" its:taConfidence="0.7" its:taClassRef="http://nerd.eurecom.fr/ontology#Place" 
              its:taIdentRef="http://dbpedia.org/resource/Antarctica">Antarctique</mrk> 
            à la vie sauvage de savanna 
            <mrk mtype="x-its" its:taConfidence="0.7" its:taClassRef="http://nerd.eurecom.fr/ontology#Place" 
              its:taIdentRef="http://dbpedia.org/resource/Africa">Afrique</mrk> 
            de mystérieux châteaux dans les forêts de la Bohème, notre offre vous prenne dans certains 
            endroits les plus sensationnels sur notre planète.
        </target>
    </alt-trans>
</trans-unit>

 
MT Confidence (output)
<trans-unit id="#4">
    <source xml:lang="English">
        The first 'Classic' writer was Aulus Gellius a 2nd-century Roman writer 
        who in the miscellany Noctes Atticae (19 8 15) refers to a writer as 
        a <mrk mtype="protected">Classicus scriptor non proletarius</mrk>
        ('A distinguished not a commonplace writer'). 
    </source>
    <alt-trans match-quality="0.749" origin="MT" its:provenanceRecordsRef="#pr3"
                its:annotatorsRef="mtconfidence|http://mlwlt.moravia.com/mlwlt-service-xliff-mt/mlwlt-service.asmx">
        <target xml:lang="Spanish">
            La primera 'Classic' Writer era aulus gellius un 2nd-century Roman Writer 
            que en la miscellany noctes atticae (19 8 15) refers a una Writer como un
            <mrk mtype="protected">Classicus scriptor non proletarius</mrk>
            ('un Distinguished no una moneda corriente Writer').
        </target>
    </alt-trans>
</trans-unit>

 
Provenance record (output)
<xliff>
    <file>
        <header>
            <its:provenanceRecords xml:id="pr1">
                <its:provenanceRecord provRef="http://www.cngl.ie/logger/logs/3f37155d-6f01-4abd-a7c0-87d64fc9f0fa" />
            </its:provenanceRecords>
            <its:provenanceRecords xml:id="pr2">
                <its:provenanceRecord provRef="http://www.cngl.ie/logger/logs/501bae14-3b8d-4b15-875e-7c442f2e1b1e" />
            </its:provenanceRecords>
            <its:provenanceRecords xml:id="pr3">
                <its:provenanceRecord its:tool="mosesmt" its:orgRef="http://www.moravia.com" its:provRef=""/>
            </its:provenanceRecords>
            .
            .
            .
<trans-unit id="#4">
    <source xml:lang="English">
        The first 'Classic' writer was Aulus Gellius a 2nd-century Roman writer 
        who in the miscellany Noctes Atticae (19 8 15) refers to a writer as 
        a <mrk mtype="protected">Classicus scriptor non proletarius</mrk>
        ('A distinguished not a commonplace writer'). 
    </source>
    <alt-trans match-quality="0.749" origin="MT" its:provenanceRecordsRef="#pr3"
                its:annotatorsRef="mtconfidence|http://mlwlt.moravia.com/mlwlt-service-xliff-mt/mlwlt-service.asmx">
        <target xml:lang="Spanish" >
            La primera 'Classic' Writer era aulus gellius un 2nd-century Roman Writer 
            que en la miscellany noctes atticae (19 8 15) refers a una Writer como un
            <mrk mtype="protected">Classicus scriptor non proletarius</mrk>
            ('un Distinguished no una moneda corriente Writer').
        </target>
    </alt-trans>
</trans-unit>