Full Text:   <2481>

CLC number: TP391

On-line Access: 2013-03-05

Received: 2012-03-11

Revision Accepted: 2013-01-13

Crosschecked: 2013-02-22

Cited: 2

Clicked: 5722

Citations:  Bibtex RefMan EndNote GB/T7714

-   Go to

Article info.
Open peer comments

Journal of Zhejiang University SCIENCE C 2013 Vol.14 No.3 P.179-196

http://doi.org/10.1631/jzus.C1200061


Punjabi DeConverter for generating Punjabi from Universal Networking Language


Author(s):  Parteek Kumar, Rajendra Kumar Sharma

Affiliation(s):  Department of Computer Science & Engineering, Thapar University, Patiala 147004, India; more

Corresponding email(s):   parteek.bhatia, rksharma@thapar.edu

Key Words:  DeConverter, EnConverter, Machine translation, Universal Networking Language (UNL), Syntactic linearization


Parteek Kumar, Rajendra Kumar Sharma. Punjabi DeConverter for generating Punjabi from Universal Networking Language[J]. Journal of Zhejiang University Science C, 2013, 14(3): 179-196.

@article{title="Punjabi DeConverter for generating Punjabi from Universal Networking Language",
author="Parteek Kumar, Rajendra Kumar Sharma",
journal="Journal of Zhejiang University Science C",
volume="14",
number="3",
pages="179-196",
year="2013",
publisher="Zhejiang University Press & Springer",
doi="10.1631/jzus.C1200061"
}

%0 Journal Article
%T Punjabi DeConverter for generating Punjabi from Universal Networking Language
%A Parteek Kumar
%A Rajendra Kumar Sharma
%J Journal of Zhejiang University SCIENCE C
%V 14
%N 3
%P 179-196
%@ 1869-1951
%D 2013
%I Zhejiang University Press & Springer
%DOI 10.1631/jzus.C1200061

TY - JOUR
T1 - Punjabi DeConverter for generating Punjabi from Universal Networking Language
A1 - Parteek Kumar
A1 - Rajendra Kumar Sharma
J0 - Journal of Zhejiang University Science C
VL - 14
IS - 3
SP - 179
EP - 196
%@ 1869-1951
Y1 - 2013
PB - Zhejiang University Press & Springer
ER -
DOI - 10.1631/jzus.C1200061


Abstract: 
deConverter is core software in a universal Networking Language (UNL) system. A UNL system has enConverter and deConverter as its two major components. enConverter is used to convert a natural language sentence into an equivalent UNL expression, and deConverter is used to generate a natural language sentence from an input UNL expression. This paper presents design and development of a Punjabi deConverter. It describes five phases of the proposed Punjabi deConverter, i.e., UNL parser, lexeme selection, morphology generation, function word insertion, and syntactic linearization. This paper also illustrates all these phases of the Punjabi deConverter with a special focus on syntactic linearization issues of the Punjabi deConverter. syntactic linearization is the process of defining arrangements of words in generated output. The algorithms and pseudocodes for implementation of syntactic linearization of a simple UNL graph, a UNL graph with scope nodes and a node having un-traversed parents or multiple parents in a UNL graph have been discussed in this paper. Special cases of syntactic linearization with respect to Punjabi language for UNL relations like ‘and’, ‘or’, ‘fmt’, ‘cnt’, and ‘seq’ have also been presented in this paper. This paper also provides implementation results of the proposed Punjabi deConverter. The deConverter has been tested on 1000 UNL expressions by considering a Spanish UNL language server and agricultural domain threads developed by Indian Institute of Technology (IIT), Bombay, India, as gold-standards. The proposed system generates 89.0% grammatically correct sentences, 92.0% faithful sentences to the original sentences, and has a fluency score of 3.61 and an adequacy score of 3.70 on a 4-point scale. The system is also able to achieve a bilingual evaluation understudy (BLEU) score of 0.72.

Darkslateblue:Affiliate; Royal Blue:Author; Turquoise:Article

Reference

[1]Bhattacharyya, P., 2001. Multilingual Information Processing Through Universal Networking Language. Indo UK Workshop on Language Engineering for South Asian Languages, p.1-10.

[2]Blanc, É., 2005. About and around the French EnConverter and the French DeConverter. Res. Comput. Sci., 12:157-166.

[3]Boguslavsky, I., Frid, N., Iomdin, L., Kreidlin, L., Sagalova, I., Sizov, V., 2000. Creating a Universal Networking Language Module within an Advanced NLP System. 18th Int. Conf. on Computational Linguistics, p.83-89.

[4]Boguslavsky, I., Cardeñosa, J., Gallardo, C., Iraola, L., 2005. The UNL initiative: an overview. LNCS, 3406:377-387.

[5]Daoud, M., 2005. Arabic generation in the framework of the Universal Networking Language. Res. Comput. Sci., 12: 195-209.

[6]Dave, S., Parikh, J., Bhattacharyya, P., 2001. Interlingua based English Hindi machine translation and language divergence. J. Mach. Transl., 16(4):251-304.

[7]Dey, K., Bhattacharyya, P., 2005. Universal Networking Language based analysis and generation of Bengali case structure constructs. Res. Comput. Sci., 12:215-229.

[8]Dhanabalan, T., Geetha, V., 2003. UNL DeConverter for Tamil. Int. Conf. on the Convergence of Knowledge, Culture, Language and Information Technologies, p.1-6.

[9]Hrushikesh, B., 2002. Towards Marathi Sentence Generation from Universal Networking Language. MT Thesis, Indian Institute of Technology, Bombay, Mumbai.

[10]Keshari, B., Bista, K., 2005. UNL Nepali DeConverter. 3rd Int. Conf. on CALIBER, p.70-76.

[11]Lewis, M.P., 2009. Ethnologue: Languages of the World (16th Ed.). SIL International, Dallas.

[12]Linguistic Data Consortium (LDC), 2005. Linguistic Data Annotation Specification: Assessment of Adequacy and Fluency in Translations. Revision 1.5, Technical Report.

[13]Martins, T., Rino, M., Osvaldo, N., Hasegawa, R., Nunes, V., 1997. Specification of the UNL-Portuguese EnConverter-DeConverter Prototype. Relatórios Técnicos do ICMC-USP, p.1-10.

[14]Nalawade, A., 2007. Natural Language Generation from Universal Networking Language. MT Thesis, Indian Institute of Technology, Bombay, Mumbai.

[15]Pelizzoni, J., Nunes, M., 2005. Flexibility, configurability and optimality in UNL DeConversion via multiparadigm programming. Res. Comput. Sci., 12:175-194.

[16]Raman, S., Alwar, N., 1990. An AI-based approach to machine translation in Indian languages. Commun. ACM, 33(5):521-527.

[17]Shi, X., Chen, Y., 2005. A UNL DeConverter for Chinese Universal Network Language. Res. Comput. Sci., 12: 167-174.

[18]Singh, S., Dalal, M., Vachhani, V., Bhattacharyya, P., Damani, O.P., 2007. Hindi Generation from Interlingua. 17th MT Summit, Copenhagen, Denmark, p.1-8.

[19]Sinha, R., 2005. Hindi Generation: Syntax Planning and Case Marking. Mini Project Report, Indian Institute of Technology, Bombay, Mumbai.

[20]Uchida, H., 2005. Universal Networking Language (UNL): Specifications Version 2005, UNDL Foundation.

[21]Vachhani, V., 2006. UNL to Hindi DeConverter. BE Thesis, Dharamsinh Desai Institute of Technology, Nadiad.

[22]Vora, A., 2002. Generation of Hindi sentences from Universal Networking Language. BE Thesis, Dharamsinh Desai Institute of Technology, Nadiad.

Open peer comments: Debate/Discuss/Question/Opinion

<1>

Please provide your name, email address and a comment





Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952783; E-mail: cjzhang@zju.edu.cn
Copyright © 2000 - 2024 Journal of Zhejiang University-SCIENCE