Proceedings of the 11th Joint Conference on Information Sciences (JCIS 2008)

Study on Word Alignment for Reordering of Web-mined OOV Translation Candidates

Authors
Shuang Li1, Meng Sun, Yang Yang, Jianmin Yao
1Yao Jian Min
Corresponding Author
Shuang Li
Available Online December 2008.
DOI
10.2991/jcis.2008.105How to use a DOI?
Keywords
Web-based Data Mining; Word Align-ment; OOV Translation; Natural Lan-guage Processing
Abstract

Web information retrieval technology has been widespread concerned by research-ers. Web-based search of the OOV Trans-lation Mining has also become hot spots. In this paper, the re-ordering of OOV translation candidates is studied, which is the result of web mining. Automatic word alignment technology is used to calculate the weighted points for each candidate then sort the results, with the closely right candidate to be top ranking. The approach can make good performance in such as-pects like equal frequency or low fre-quency circumstances. We take some OOV phrases in different fields as test corpora for web mining, and evaluate out the method on it,and the result is en-couraging.

Copyright
© 2008, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Download article (PDF)

Volume Title
Proceedings of the 11th Joint Conference on Information Sciences (JCIS 2008)
Series
Advances in Intelligent Systems Research
Publication Date
December 2008
ISBN
10.2991/jcis.2008.105
ISSN
1951-6851
DOI
10.2991/jcis.2008.105How to use a DOI?
Copyright
© 2008, the Authors. Published by Atlantis Press.
Open Access
This is an open access article distributed under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).

Cite this article

TY  - CONF
AU  - Shuang Li
AU  - Meng Sun
AU  - Yang Yang
AU  - Jianmin Yao
PY  - 2008/12
DA  - 2008/12
TI  - Study on Word Alignment for Reordering of Web-mined OOV Translation Candidates
BT  - Proceedings of the 11th Joint Conference on Information Sciences (JCIS 2008)
PB  - Atlantis Press
SP  - 621
EP  - 626
SN  - 1951-6851
UR  - https://doi.org/10.2991/jcis.2008.105
DO  - 10.2991/jcis.2008.105
ID  - Li2008/12
ER  -