Moses-support Digest, Vol 85, Issue 20

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. 1st CFP: Computer Speech and Language Special Issue on Hybrid
Machine Translation (Marta Ruiz)
2. Re: 10 years of OPUS (Matthias Huck)
3. Re: Syntax model in source side (Hieu Hoang)
4. Re: alignment info (Hieu Hoang)


----------------------------------------------------------------------

Message: 1
Date: Wed, 6 Nov 2013 12:05:56 -0600
From: Marta Ruiz <martaruizcostajussa@gmail.com>
Subject: [Moses-support] 1st CFP: Computer Speech and Language Special
Issue on Hybrid Machine Translation
To: Marta Ruiz <martaruizcostajussa@gmail.com>
Message-ID:
<CABEBqHJLiyMPgmekDVqTPfLG5gvbijt5zqxJ3kT+ccydCmaZ7g@mail.gmail.com>
Content-Type: text/plain; charset="iso-8859-1"

Hybrid Machine Translation focuses on combining the best properties of
different Machine Translation (MT) paradigms. Nowadays, it is very popular
to include linguistic features in Statistical Machine Translation (SMT)
systems or to modify the standard Rule-based Machine Translation (RBMT)
architecture to include statistical knowledge. Other ways of hybridization
include the system combination techniques which may combine a rich variety
of translation paradigms.

This special issue in the prestigious Computer Speech and Language journal
will cover different approaches from Hybrid Machine Translation and take
advantage of the latest and leading research to discuss the progress in the
field.

This special issue invites contributions related to (but not limited):

1.

theoretical and experimental design of hybrid MT architectures
2.

experimental results with hybrid MT systems guided by corpus-based or
rule-based systems
3.

introduction of linguistics in corpus-based approaches
4.

rule-based systems extended or built with statistical information
5.

induction of lexical or grammatical transfer rules from corpora
6.

description of open source tools and language resources for hybrid MT
7.

description of computationally efficient algorithms for hybrid MT
8.

applications of hybrid MT systems
9.

hybrid methods applied to spoken language translation (SLT)
10.

hybrid evaluation methods
11.

system combination of different MT and SLT paradigms.

*Submission procedure*
Prospective authors should follow the regular guidelines of the Computer
Speech and Language Journal for electronic submission
(*http://ees.elsevier.com/csl
<http://ees.elsevier.com/csl>*). During submission authors must select "SI:
Hybrid Machine Translation" as Article Type.

*Important dates: *
1st March 2014: Deadline for Submissions
1st June 2014: Notification of Resubmission
1st July 2014: Deadline for Resubmission
1st August 2014: Final Decision


*Guest Editors:*
Dr. Marta R. Costa-juss?, Institute for Infocomm Research, Singapore,
*martaruizcostajussa@gmail.com
<martaruizcostajussa@gmail.com>*
Prof. Jos? A. R. Fonollosa, Universitat Polit?cnica de Catalunya,
Barcelona, *jose.fonollosa@upc.edu <jose.fonollosa@upc.edu>*
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20131106/6307f1b7/attachment-0001.htm

------------------------------

Message: 2
Date: Wed, 06 Nov 2013 18:42:58 +0000
From: Matthias Huck <mhuck@inf.ed.ac.uk>
Subject: Re: [Moses-support] 10 years of OPUS
To: moses-support@mit.edu
Message-ID: <1383763378.20373.154.camel@portedgar>
Content-Type: text/plain; charset="UTF-8"

Hi James,

There has been a vast literature on adaptation techniques for SMT in
recent years.

Some reading suggestions:

http://www.statmt.org/wmt07/pdf/WMT17.pdf
http://www.statmt.org/wmt09/pdf/WMT-0932.pdf
http://dl.acm.org/citation.cfm?id=1870702
http://www.mt-archive.info/IWSLT-2011-Bisazza.pdf
http://www.aclweb.org/anthology/P12-1099
http://amta2012.amtaweb.org/AMTA2012Files/papers/115.pdf
http://amta2012.amtaweb.org/AMTA2012Files/papers/152.pdf
http://www.hltpr.rwth-aachen.de/publications/download/832/Mansour-IWSLT-2012.pdf
http://www.aclweb.org/anthology/P/P13/P13-1141.pdf

See also http://www.statmt.org/survey/Topic/DomainAdaptation .

You'll be able to find many more interesting papers about that topic.
Not sure to what extent your idea differs from what has been suggested
in previous work.

Regarding your question about distortion, you may also want to consult
the literature first. Philipp Koehn wrote a nice textbook about all the
basics of SMT: http://www.statmt.org/book/

Cheers,
Matthias


On Wed, 2013-11-06 at 09:35 -0800, Kenneth Heafield wrote:
> Hi,
>
> Multiple column won't really work because the set of phrase pairs will
> be different. You could of course take the union of phrase pairs and
> just have null values for inapplicable phrases, but it's not clear how
> much compression you'd get.
>
> Kenneth
>
> On 11/06/13 06:21, Read, James C wrote:
> > So here's a random crazy idea I had lately. A phrase table could have multiple columns giving different scores for different probabilities from different alignments, different corpora, different domains etc. Recent work at Edinburgh, Cambridge and Sheffield has had some emphasis on adaptation of models for speech recognition purposes. I guess a similar principle could be applied to SMT. Given a text from some unknown domain the engine could perform some automated recognition test to guess which translation model best fits the text to be translated. A primitive form of automatic domain recognition and adaptation if you like.
> >
> > I guess even making available multiple forms of a phrase table or a single compact version with multiple columns for scoring could even have some demand in the future.
> >
> > James



--
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.



------------------------------

Message: 3
Date: Wed, 6 Nov 2013 18:46:44 +0000
From: Hieu Hoang <Hieu.Hoang@ed.ac.uk>
Subject: Re: [Moses-support] Syntax model in source side
To: burak ayd?n <baydinx@gmail.com>, moses-support
<moses-support@mit.edu>
Message-ID:
<CAEKMkbiq1hunEfeMtVwJhuJSkWgoW-tKaLQ2o79PBm1Nbbt=cw@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

i don't think the EMS supports source-syntax models. You will have to train
the models manually.

Don't be surprised if the BLEU is low, most syntax systems achieve lower
bleu score than phrase-based and hiero models.



On 4 November 2013 14:50, burak ayd?n <baydinx@gmail.com> wrote:

> Hi everyone,
>
> I want to use Collins parser while translating from En. I checked the
> sample ems configs and applied it. The experiment did not crash or get any
> error, but bleu scores were dramatically low, implying that there must be
> something wrong. Here the additional parameters for sytnax with Collins' :
>
> #syntactic parsers
> input-parser = "$moses-script-dir/training/wrappers/parse-en-collins.perl
> -collins /usr/local/smt/COLLINS-PARSER -mxpost /usr/local/smt/MXPOST/ "
>
> #training options
> training-options = "-mgiza -mgiza-cpus 4 -sort-buffer-size 8G
> -sort-compress gzip -sort-parallel 4 -cores 4 -source-syntax"
>
> Do I need additional parameters except the ones above? I would appreciate
> any help.
>
> Thanks
>
>
> 2013/11/4 burak ayd?n <baydinx@gmail.com>
>
>> Hi everyone,
>>
>> I want to use Collins parser while translating from En. I checked the
>> sample ems configs and applied it. The experiment did not crash or get any
>> error, but bleu scores were dramatically low, implying that there must be
>> something wrong. Here the additional parameters for sytnax with Collins' :
>>
>> #syntactic parsers
>> input-parser = "$moses-script-dir/training/wrappers/parse-en-collins.perl
>> -collins /usr/local/smt/COLLINS-PARSER -mxpost /usr/local/smt/MXPOST/ "
>>
>> #training options
>> training-options = "-mgiza -mgiza-cpus 4 -sort-buffer-size 8G
>> -sort-compress gzip -sort-parallel 4 -cores 4 -source-syntax"
>>
>> Do I need additional parameters except the ones above? I would appreciate
>> any help.
>>
>> Thanks
>> Burak
>>
>>
>>
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>


--
Hieu Hoang
Research Associate
University of Edinburgh
http://www.hoang.co.uk/hieu
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20131106/c43ad63c/attachment-0001.htm

------------------------------

Message: 4
Date: Wed, 6 Nov 2013 18:48:41 +0000
From: Hieu Hoang <Hieu.Hoang@ed.ac.uk>
Subject: Re: [Moses-support] alignment info
To: Jorg Tiedemann <tiedeman@gmail.com>
Cc: "moses-support@MIT.EDU" <moses-support@mit.edu>
Message-ID:
<CAEKMkbhqcfxCE_eQ-K29=U6wvJwdpmHAN8chEe9CyO1qBb=Z3w@mail.gmail.com>
Content-Type: text/plain; charset="windows-1252"

you probably need
-report-segmentation



On 5 November 2013 19:38, Jorg Tiedemann <tiedeman@gmail.com> wrote:

> Hi again,
>
> I have another question about the alignment info option. My output is
> different from what I expect according to
> http://www.statmt.org/moses/?n=Moses.AdvancedFeatures#ntoc9
>
> I get only something like
> ?. ||| 0-0 1-1 2-2 3-3
>
> How do I get both phrase and word alignment in my output?
> (like described here:
> http://article.gmane.org/gmane.comp.nlp.moses.user/3776/match=alignment+info
> )
>
> Thanks!
> J?rg
>
>
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>



--
Hieu Hoang
Research Associate
University of Edinburgh
http://www.hoang.co.uk/hieu
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20131106/fae9ab5e/attachment.htm

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 85, Issue 20
*********************************************

0 Response to "Moses-support Digest, Vol 85, Issue 20"

Post a Comment