Moses-support Digest, Vol 125, Issue 15

Send Moses-support mailing list submissions to
moses-support@mit.edu

To subscribe or unsubscribe via the World Wide Web, visit
http://mailman.mit.edu/mailman/listinfo/moses-support
or, via email, send a message with subject or body 'help' to
moses-support-request@mit.edu

You can reach the person managing the list at
moses-support-owner@mit.edu

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Moses-support digest..."


Today's Topics:

1. Second Call - EAMT 2017 Workshop on Social Media and User
Generated Content Machine Translation (Social MT 2017) (Haithem AFLI)
2. Re: mosesserver output empty string for hie models (Guchun Zhang)


----------------------------------------------------------------------

Message: 1
Date: Wed, 8 Mar 2017 17:26:46 +0000
From: Haithem AFLI <aflihaithem@gmail.com>
Subject: [Moses-support] Second Call - EAMT 2017 Workshop on Social
Media and User Generated Content Machine Translation (Social MT 2017)
To: moses-support@mit.edu, mt-list@eamt.org, corpora@uib.no,
ln@cines.fr, echos@ens.fr, wmt-tasks@googlegroups.com
Cc: Haithem Afli <haithem.afli@adaptcentre.ie>
Message-ID:
<CALsfB6-SZCMfPRZYkVg7vYxfB=-OAeBwBqWA_gGoTqMeprsKsQ@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

SECOND CALL FOR PAPERS
Apologies for multiple postings.

The first Workshop on Social Media and User Generated Content Machine
Translation (Social MT 2017) Co-located with EAMT 2017, Prague, Czech
Republic
For more information please visit:
https://sites.google.com/view/socialmt

CALL FOR PAPERS

With the widespread adoption of social media and online forums, individual
users have been able to actively participate in the generation of online
content in different languages and dialects. As a result, user-generated
content (UGC) has seen an enormous growth in the recent years. The nature
of UGC means that it can be generated at any time and in non-standard
language or formats. Compared to professionally edited text, it is often
more noisy, and likely to take some liberty with commonly established
grammar, punctuation and spelling norms. All this can make it difficult to
translate but UGC can also be incredibly valuable. This workshop will
explore the multifarious aspects of effective MT of data extracted from
social media.
The workshop aims to provide a research platform dedicated to new method
and techniques on translating user-generated content and exploring the use
of such transition on social media analytics. The workshop will solicit
original research contributions related to the theme, which includes (but
is not limited to):

- Models and Tools Development for Social MT
- Machine translation on Microblogs
- Multi-lingual social analytics
- Neural MT for UGC translation
- Multilingual crowdsourcing
- Building resources for UGC translation
- Sentiment translation of UGC
- Analyzing the diffusion of multilingual information
- Using MT for monitoring emergency responses among social crowds
- Multilingual Social-based web platform for disaster management
- Multilingual and language-specific Information Retrieval on Social Web
- Crosslingual document alignment using UGC data
- Named entity transliteration on social media content
- Code-mixed UGC translation
- MT for Big social data analysis

Submissions may include work in progress as well as finished work.
Submissions must have a clear focus on specific issues pertaining to UGC
and its translation. Descriptions of commercial systems are welcome, but
authors should be willing to discuss the details of their work.

IMPORTANT DATES

January 30, 2017: First Call for Workshop Papers
March 8, 2017: Second Call for Workshop Papers
March 24, 2017: Workshop Paper Due Date
April 14, 2017: Notification of Acceptance
May 12, 2017: Camera-ready papers due
May 31, 2017: Workshop Date (half-day workshop)

SUBMISSION FORMAT

Submissions must conform to the official style guidelines for EAMT 2017 (
https://ufal.mff.cuni.cz/pbml/instructions-authors).
Overleaf Project to Clone: https://www.overleaf.com/read/jgvrjmrqnwct
MS Word Template:
https://ufal.mff.cuni.cz/eamt2017/files/templates/eamt17.doc

Contributions can be short or long papers. Short paper submission must
describe original and unpublished work without exceeding eight (8) pages plus
any number of pages for references. Characteristics of short papers
include: a small, focused contribution; work in progress; a negative
result; an opinion piece; an interesting application nugget. Long paper
submissions must describe substantial, original, completed and unpublished
work without exceeding twelve (12) pages plus any number of pages for
references.
Reviewing will be double-blind, so the papers should not reveal the
authors? identity. Accepted papers will be published in the workshop
proceedings.
Double submission policy: Parallel submission to other meetings or
publications is possible but must be immediately notified to the workshop
organizers.
Submission Website: https://easychair.org/conferences/?conf=socialmt2017

Extended versions of the best papers will be published into an upcoming
special issue of ?Translating User Generated Content? on Machine
Translation Journal

INVITED SPEAKER:

Houda Bouamor (Carnegie Mellon University, Qatar)

WORKSHOP ORGANIZERS

General Chair: Andy Way (ADAPT Centre, Dublin City University)

Program Chair :Haithem Afli (ADAPT Centre, Dublin City University)

Program Committee
Lo?c Barrault (LIUM, Le Mans University)
Laurent Besacier (LIG, Grenoble University)
Philipp Koehn (University of Edinburgh / Johns Hopkins University)
Abdelkarim Mars (Grenoble University)
Matteo Negri (FBK)
Houda Bouamor (CMU Qatar)
Yvette Graham (ADAPT Centre, Dublin City University)
Dimitar Shterionov (KantanMT)
Marco Turchi (FBK)
Antonio Toral (University of Groningen)
Lucia Specia (University of Sheffield)
Kashif Shah (eBay)
Rejwanul Haque (Lingo24)
Barry Haddow (University of Edinburgh)
Jinhua Du (ADAPT Centre, Dublin City University)
Daniel Stein (eBay)
Mohammed Hasanuzzaman (ADAPT Centre, Dublin City University)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20170308/91696453/attachment-0001.html

------------------------------

Message: 2
Date: Thu, 9 Mar 2017 12:30:10 +0000
From: Guchun Zhang <gzhang@alphacrc.com>
Subject: Re: [Moses-support] mosesserver output empty string for hie
models
To: Hieu Hoang <hieuhoang@gmail.com>
Cc: moses-support <moses-support@mit.edu>
Message-ID:
<CA+cfSVLgDB54YhM0U1QdvFCi4_4_cCSRoB-LKmmTQF=DvnPWCg@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Hi Hieu,

It works now. I didn't manage to make binarize4moses2.perl work as
sigtest-filter didn't compile and SALM page seems down.

Thanks,
Guchun

On 8 March 2017 at 16:01, Guchun Zhang <gzhang@alphacrc.com> wrote:

> Ah, cool. Will let you know how it goes in a few days time.
>
> Thanks a lot,
> Guchun
>
> On 8 March 2017 at 14:49, Hieu Hoang <hieuhoang@gmail.com> wrote:
>
>> cool, nearly there.
>>
>> moses2 only supports ProbingPT, not PhraseDictionaryOnDisk
>> http://www.statmt.org/moses/?n=Site.Moses2
>> use the script
>> scripts/generic/binarize4moses2.perl
>> to binarize. If you are using hiero models, don't forget the flag --scfg
>>
>> * Looking for MT/NLP opportunities *
>> Hieu Hoang
>> http://moses-smt.org/
>>
>>
>> On 8 March 2017 at 14:43, Guchun Zhang <gzhang@alphacrc.com> wrote:
>>
>>> Great. The fix works. :-)
>>>
>>> As with many things, now there's this run time error:
>>>
>>> Feature name PhraseDictionaryOnDisk is not registered.Aborted (core
>>> dumped)
>>>
>>> My hie models are binarised. Should I skip binarisation for such models
>>> from now on?
>>>
>>> Thanks,
>>> Guchun
>>>
>>> On 8 March 2017 at 14:02, Hieu Hoang <hieuhoang@gmail.com> wrote:
>>>
>>>> git pull and try again. I've just pushed a fix
>>>> https://github.com/moses-smt/mosesdecoder/commit/ac3069fb32d
>>>> 87c57ac84d5bac3d07328b5ab71dd
>>>>
>>>> * Looking for MT/NLP opportunities *
>>>> Hieu Hoang
>>>> http://moses-smt.org/
>>>>
>>>>
>>>> On 8 March 2017 at 12:48, Guchun Zhang <gzhang@alphacrc.com> wrote:
>>>>
>>>>> Sure. Both are attached. Thanks, Guchun
>>>>>
>>>>> On 8 March 2017 at 11:52, Hieu Hoang <hieuhoang@gmail.com> wrote:
>>>>>
>>>>>> what's the exact bjam command you used, and can I please have a look
>>>>>> at the compile error
>>>>>>
>>>>>> * Looking for MT/NLP opportunities *
>>>>>> Hieu Hoang
>>>>>> http://moses-smt.org/
>>>>>>
>>>>>>
>>>>>> On 8 March 2017 at 11:48, Guchun Zhang <gzhang@alphacrc.com> wrote:
>>>>>>
>>>>>>> Have to say ?I can't disagree with that.?
>>>>>>>
>>>>>>> Moses2 still doesn't compile when --max-kenlm-order is passed.
>>>>>>>
>>>>>>> HTH.
>>>>>>>
>>>>>>> Guchun
>>>>>>>
>>>>>>> On 8 March 2017 at 10:29, Hieu Hoang <hieuhoang@gmail.com> wrote:
>>>>>>>
>>>>>>>> I guess the cost/benefit ratio isn't high enough for commercial
>>>>>>>> users using the server, but you can probably answer that better. Hopefully
>>>>>>>> the faster Moses2 implementation can improve that.
>>>>>>>>
>>>>>>>> If it doesn't compile, please let me know
>>>>>>>>
>>>>>>>> * Looking for MT/NLP opportunities *
>>>>>>>> Hieu Hoang
>>>>>>>> http://moses-smt.org/
>>>>>>>>
>>>>>>>>
>>>>>>>> On 8 March 2017 at 09:31, Guchun Zhang <gzhang@alphacrc.com> wrote:
>>>>>>>>
>>>>>>>>> Also, may I ask why hie models aren't commonly used in the server
>>>>>>>>> mode?
>>>>>>>>>
>>>>>>>>> On 8 March 2017 at 09:28, Guchun Zhang <gzhang@alphacrc.com>
>>>>>>>>> wrote:
>>>>>>>>>
>>>>>>>>>> ?Thanks, Hieu. I will give Moses2 another try. Last time, it
>>>>>>>>>> didn't compile.?
>>>>>>>>>>
>>>>>>>>>> On 7 March 2017 at 18:35, Hieu Hoang <hieuhoang@gmail.com> wrote:
>>>>>>>>>>
>>>>>>>>>>> if you're not able to fix it, you might wanna try the hiero
>>>>>>>>>>> model on Moses2
>>>>>>>>>>> http://www.statmt.org/moses/?n=Site.Moses2
>>>>>>>>>>> I don't think the hiero model is used much in server mode so
>>>>>>>>>>> it's suffering from code rot.
>>>>>>>>>>>
>>>>>>>>>>> The model is much faster in Moses2 so hopefully more people will
>>>>>>>>>>> use it. The server code is also much simplier
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>> * Looking for MT/NLP opportunities *
>>>>>>>>>>> Hieu Hoang
>>>>>>>>>>> http://moses-smt.org/
>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>> On 7 March 2017 at 17:46, Guchun Zhang <gzhang@alphacrc.com>
>>>>>>>>>>> wrote:
>>>>>>>>>>>
>>>>>>>>>>>> Hi,
>>>>>>>>>>>>
>>>>>>>>>>>> I pull the repo recently and just noticed that mosesserver
>>>>>>>>>>>> outputs empty strings for hie models. The command I issued is
>>>>>>>>>>>>
>>>>>>>>>>>> *$ mosesserver --config moses.tuned.ini*
>>>>>>>>>>>>
>>>>>>>>>>>> There's no error message at all. Tried to record the server log
>>>>>>>>>>>> and this is the log:
>>>>>>>>>>>>
>>>>>>>>>>>> *127.0.0.1:43644 <http://127.0.0.1:43644> - no_user -
>>>>>>>>>>>> [07/Mar/2017:17:37:48 +0000] "POST" 200 581*
>>>>>>>>>>>>
>>>>>>>>>>>> Is "no_user" causing the problem? The same ini runs fine with
>>>>>>>>>>>> moses.
>>>>>>>>>>>>
>>>>>>>>>>>> It's been a while (more than a year at least) since last time I
>>>>>>>>>>>> looked at the source code and it seems the code has changed a lot. Where
>>>>>>>>>>>> should I start to debug it?
>>>>>>>>>>>>
>>>>>>>>>>>> Thanks,
>>>>>>>>>>>> Guchun
>>>>>>>>>>>>
>>>>>>>>>>>> _______________________________________________
>>>>>>>>>>>> Moses-support mailing list
>>>>>>>>>>>> Moses-support@mit.edu
>>>>>>>>>>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> --
>>>>>>>>>> *Guchun Zhang*
>>>>>>>>>> Machine Translation Lead
>>>>>>>>>> ? <http://www.thisisalpha.com/>
>>>>>>>>>> Office: +44(0)1223 431035 <+44%201223%20431035> | Skype:
>>>>>>>>>> mt.alpha | *thisisalpha.com* <http://www.thisisalpha.com/>
>>>>>>>>>> St Andrews House, St Andrew Road | Cambridge CB4 1DL | United
>>>>>>>>>> Kingdom
>>>>>>>>>> [image: ALPHA]
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> --
>>>>>>>>> *Guchun Zhang*
>>>>>>>>> Machine Translation Lead
>>>>>>>>> ? <http://www.thisisalpha.com/>
>>>>>>>>> Office: +44(0)1223 431035 <01223%20431035> | Skype: mt.alpha |
>>>>>>>>> *thisisalpha.com* <http://www.thisisalpha.com/>
>>>>>>>>> St Andrews House, St Andrew Road | Cambridge CB4 1DL | United
>>>>>>>>> Kingdom
>>>>>>>>> [image: ALPHA]
>>>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> --
>>>>>>> *Guchun Zhang*
>>>>>>> Machine Translation Lead
>>>>>>> ? <http://www.thisisalpha.com/>
>>>>>>> Office: +44(0)1223 431035 <01223%20431035> | Skype: mt.alpha |
>>>>>>> *thisisalpha.com* <http://www.thisisalpha.com/>
>>>>>>> St Andrews House, St Andrew Road | Cambridge CB4 1DL | United Kingdom
>>>>>>> [image: ALPHA]
>>>>>>>
>>>>>>
>>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> *Guchun Zhang*
>>>>> Machine Translation Lead
>>>>> ? <http://www.thisisalpha.com/>
>>>>> Office: +44(0)1223 431035 <01223%20431035> | Skype: mt.alpha |
>>>>> *thisisalpha.com* <http://www.thisisalpha.com/>
>>>>> St Andrews House, St Andrew Road | Cambridge CB4 1DL | United Kingdom
>>>>> [image: ALPHA]
>>>>>
>>>>
>>>>
>>>
>>>
>>> --
>>> *Guchun Zhang*
>>> Machine Translation Lead
>>> ? <http://www.thisisalpha.com/>
>>> Office: +44(0)1223 431035 <01223%20431035> | Skype: mt.alpha |
>>> *thisisalpha.com* <http://www.thisisalpha.com/>
>>> St Andrews House, St Andrew Road | Cambridge CB4 1DL | United Kingdom
>>> [image: ALPHA]
>>>
>>
>>
>
>
> --
> *Guchun Zhang*
> Machine Translation Lead
> ? <http://www.thisisalpha.com/>
> Office: +44(0)1223 431035 <+44%201223%20431035> | Skype: mt.alpha |
> *thisisalpha.com* <http://www.thisisalpha.com/>
> St Andrews House, St Andrew Road | Cambridge CB4 1DL | United Kingdom
> [image: ALPHA]
>



--
*Guchun Zhang*
Machine Translation Lead
? <http://www.thisisalpha.com/>
Office: +44(0)1223 431035 | Skype: mt.alpha | *thisisalpha.com*
<http://www.thisisalpha.com/>
St Andrews House, St Andrew Road | Cambridge CB4 1DL | United Kingdom
[image: ALPHA]
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.mit.edu/mailman/private/moses-support/attachments/20170309/17b986d8/attachment.html
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image002.jpg
Type: image/jpeg
Size: 67617 bytes
Desc: not available
Url : http://mailman.mit.edu/mailman/private/moses-support/attachments/20170309/17b986d8/attachment.jpg

------------------------------

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


End of Moses-support Digest, Vol 125, Issue 15
**********************************************

0 Response to "Moses-support Digest, Vol 125, Issue 15"

Post a Comment