ABSTRACT
This paper describes a method to find phrase-level translation patterns from parallel corpora by applying dependency structure analysis. We use statistical dependency parsers to determine dependency relations between base phrases in a sentence. Our method is tested with a business expression corpus containing 10000 English-Japanese sentence pairs and achieved approximately 90% accuracy in extracting bilingual correspondences. The result shows that the use of dependency relation helps to acquire interesting translation patterns.
- P. F. Brown, J. C. Lai, and R. L. Mercer. 1991. Aligning sentences in parallel corpora. In ACL-29:29th Annual Meeting of the Association for Computational Linguistics, pages 169--176. Google ScholarDigital Library
- E. Charniak. 2000. A maximum-entropy-inspired parser. In NAACL-2000: 1st Meeting of the North American Chapter of the Association for Computational Linguistics, pages 132--139. Google Scholar
- M. J. Collins. 1997. Three generative, lexicalised models for statistical parsing. In ACL-97: 35th Annual Meeting of the Association for Computational Linguistics, pages 16--23. Google ScholarDigital Library
- I. Dagan, K. Church, and W. Gale. 1992. Robust bilingual word alignment for machine aided translation. In Proc. of the Workshop on Very Large Corpora, pages 1--8.Google Scholar
- M. Fujio and Y. Matsumoto. 1998. Japanese dependency structure analysis based on lexicalized statistics. In Proc. of 3rd Conf. on Emperical Methods in Natural Language Processing, pages 88--96.Google Scholar
- R. Hudson. 1984. Word Grammar. Blackwell.Google Scholar
- M. Kitamura and Y. Matsumoto. 1995. A machine translation system based on translation rules acquired from parallel corpora. In Proc. of Recent Advances in Natural Lannguage Processing, pages 27--44.Google Scholar
- M. Kitamura and Y. Matsumoto. 1996. Automatic extraction of word sequence correspondences in parallel corpora. In Proc. 4th Workshop on Very Large Corpora, pages 79--87.Google Scholar
- Y. Matsumoto, H. Ishimoto, and T. Utsuro. 1993. Structural matching of parallel texts. In ACL-93: 31st Annual Meeting of the Association for Computational Linguistics, pages 23--30. Google ScholarDigital Library
- I. D. Melamed. 1995. Automatic evaluation and uniform filter cascades for inducing n-best translation lexicons. In Proc. of 3rd Workshop on Very Large Corpora, pages 184--198.Google Scholar
- A. Ratnaparkhi. 1997. A linear observed time statistical parser based on maximum entropy models. In Proc. of 2nd Conf. on Emperical Methods in Natural Language Processing, pages 1--10.Google Scholar
- K. Takubo and M. Hashimoto. 1995. A Dictionary of English Business Letter Expressions. Nihon Keizai Shimbun, Inc.Google Scholar
- Acquisition of phrase-level bilingual correspondence using dependency structure
Recommendations
Converting Dependency Structure Into Persian Phrase Structure
Treebank is one of the important and useful resources in natural language processing represented in two different annotated schemas: phrase and dependency structures. There are many works that convert a phrase structure into a dependency structure and ...
Cross language dependency parsing using a bilingual lexicon
ACL '09: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1This paper proposes an approach to enhance dependency parsing in a language by using a translated treebank from another language. A simple statistical machine translation method, word-by-word decoding, where not a parallel corpus but a bilingual lexicon ...
Phrase structure parsing with dependency structure
COLING '10: Proceedings of the 23rd International Conference on Computational Linguistics: PostersIn this paper we present a novel phrase structure parsing approach with the help of dependency structure. Different with existing phrase parsers, in our approach the inference procedure is guided by dependency structure, which makes the parsing ...
Comments