22

8. CONCLUSION

This research work is concerned with the problem of developing a framework for Telugu cross language information retrieval. In particular, this research work is concerned with the problem of making use of bilingual ontologies and language grammar rules for Telugu information retrieval.

The problem we faced in the evaluation of CLIR using different approaches was that the retrieval performance may be affected by several factors: stemming, term segmentation, and retrieval models, for example. The results suggest that the language grammar based model led to much better retrieval performance than traditional methods.

The principal objective attained in this research work, as shown by our methodology and the results of our experiments, was the approach to cross language information retrieval for Telugu using the ontology and language grammar rules for query and content conversion.

The research work presented in this thesis has developed a new grammar rule based technique to process the user given queries. It leads to an improvement in CLIR effectiveness and can also be used to improve in retrieving of relevant information for given Telugu query.

In this research, we provided new ways to acquire linguistic resources using multilingual content on the web. These linguistic resources not only improve the efficiency and effectiveness of Telugu English cross-language Information retrieval but also have wider applications than CLIR. The focus for the future will be on designing strategies that can convert the full content in the retrieved results.

We evaluated the user acceptance of retrieval performance attained under using the rule based cross language information retrieval for Telugu using technology acceptance model.

Limitations and Future Work

The main focus of the work presented in this thesis was the investigation of our hypothesis for rule based cross language information retrieval for Telugu, namely that a CLIR for Telugu can perform better using bilingual ontologies and language grammar rules to convert user queries and content retrieved for the user given query than using classic dictionary translation approaches.

Content conversion is another issue. There is no gold standard or complete set of content in Telugu language, which implies that there is a need for content conversion mechanism for Telugu cross language information retrieval.

There is also a series of research aspects related to CLIR requiring further investigation, such as domain knowledge acquisition, complete conversion of the content represented by the snippets and the adaptation of the algorithm for mobile devices.


BIBLIOGRAPHY

1.  Borodin, Y., Mahmud, J., Ramakrishnan, I. V. “Context browsing with mobiles-when less is more,” in Proc. Mobisys, 2007.

2.  Bruce, H, User satisfaction with information seeking on the Internet, Journal of the American Society for Information Science, Vol.49, No.6, 1998, pp.541-556.

3.  Carol Peters, Martin Braschler, Paul Clough, “Cross-Language Information Retrieval”, Springer Berlin Heidelberg, 2012, pp. 57-84

4.  Carpineto, C., Pietra, A., Mizzaro, S. and Romano, G. “Mobile information Retrieval”, Lecture Notes in Computer Science, Vol. 3936, Advances in Information Retrieval, pp. 155-166, 2006.

5.  Carpineto. C., Romano. G., Snidero. M “Mobile information Retrieval with Search Results Clustering: Prototypes and Evaluation”, In Journal of the American Society for Information Science and Technology, Volume 60, Issue 5, pages 877-895, May 2009.

6.  Castellano, G., Mesto, F., Minunno, M. and Torsello, A. “Web user profiling using fuzzy clustering : In Applications of Fuzzy Sets Theory,” in Proc. WILF, 2007 .

7.  Church, K., Smyth, B., Bradley, K. and Cotter, P. “A large scale study of European mobile search behavior,” in Proc. MobileHCI’08, 2008.

8.  Douglas W. Oard, Daqing He, Jianqiang Wang , “User-assisted query translation for interactive cross-language information retrieval”, Information Processing & Management, Vol.44, No.1, pp. 181-211, January 2008.

9.  E. Ngai, J. Poon, and Y. Chan, Empirical examination of the adoption of WebCT using TAM, Journal of Computers & Education, vol. 48, 2007, pp.250-267.

10. G. Madhavi, M. Balakrishnan, and N. Balakrishnan Reddy “Om: One tool for many (Indian) languages”, Journal of Zhejiang University Science, Vol.,6, No.,11, pp. 1348-1353, 2005.

11. Gatian,A.W., Is user satisfaction a valid measure of system effectiveness? Information and Management, Vol.26, No.3, 1994, pp119-131.

12. Gérald Kembellec, Imad Saleh, Catherine Sauvaget , “OntologyNavigator: WEB 2.0 scalable ontology based CLIR portal to IT scientific corpus for researchers”, International Journal of Design Sciences and Technology 16, 2 (2009).

13. Gluck, M., Exploring the relationship between user satisfaction and relevance in information systems, Information Processing and Management, Vol.32 No.1, 1996, pp.89-104.

14. Goenka, K., Arpinar, I. B. and Mustafa, N. “Mobile Web Search Personalization using Ontological User profile,” in Proc. ACM SE’10, 2010.

15. Griffiths, J., Johnson, F. & Hartley, R., User satisfaction as a measure of system performance, Journal of Librarianship and Science, Vol.39, No.3, 2007, pp.142-152.

16. Homa. B, Hashemi and Shakery. A “Mining a Persian–English comparable corpus for cross-language information retrieval”, Information Processing & Management, Vol.50, No. 2, pp.384-398, March 2014.

17. Huffman,S.B., and Hochster, M., How well does result relevance predict session satisfaction?, Proceedings of the annual international ACM SIGIR conference on Research and development in information retrieval, 2007, pp.567-574.

18. Huo. Z., Zhao. J., Hu. X “Web Data Management for Mobile Users, Network and Parallel Computing Workshops”, In NPC Workshops IFIP International Conference on 18-21 Sept, 2007.

19. A. Ashish, and P. Bhattacharyya “Using Morphology to Improve Marathi Monolingual Information Retrieval” Indian Institute of Technology, Bombay. India. Source http://www.isical.ac.in/~fire/paper/Ashish_almeida-IITB-fire2008.pdf, 2008.

20. A. Menon, S. Saravanan, R. Loganathan and K. Soman “Amrita Morph Analyzer and Generator for Tamil: A Rule-Based Approach”, Tamil Internet Conference, Cologne, Germany, pp. 239-243, 2009.

21. Aitao Chen and Fredric C. Gey “Combining Query Translation and Document Translation in Cross-Language Retrieval”, In Comparative Evaluation of Multilingual Information Access Systems, volume 3237, pages 121–124, Berlin, Heidelberg, 2004.

22. Allan, J., Aslam, J., Belkin, N., Buckley, C., Callan, J., Croft, B. and Dumais, S. “Challenges in Information Retrieval and Language Modeling”, Report of a Workshop held at the Center for Intelligent Information Retrieval, 2002.

23. Anand Kumar M, Dhanalakshmi V, Rajendran S, Soman K P: A Novel Approach to Morphological "Hörsaalgebäude" of the University of Koeln Köln, Universitätsstrasse 35, Albertus-Magnus-Platz 1,Germany, 2009.

24. Arias, M., Cantera, J. M., Vegas, J., Fuente, J., Alonso, J. C., Bernardo, G., Llamas, C. and Zubizarreta, A. “Context-based personalization for mobile web search,” in Proc. PersDB2008, 2008.

25. Banu. W.A., Kader. P.S.A “A Hybrid Context Based Approach for Web Information Retrieval”, In International Journal of Computer Applications, article 5, 2010.

26. Bergstorm, A., Jaksetic, P. and Nordin, P. “Enhancing information retrieval by automatic acquisition of textual relations using genetic programming,” in Proc. IUI’00, 2000.

27. D Mandal, M Gupta, S Dandapat, P Banerjee, and S Sarkar “Bengali and Hindi to English CLIR Evaluation”, Journal of Advances in Multilingual and Multimodal Information Retrieval, Springer Berlin Heidelberg Series, Vol., 5152, ISSN 0302-9743, pp. 95-102, 2008.

28. D.He and D. Wu “Translation enhancement: A new relevance feedback method for cross-language information retrieval”, in CIKM ’08: Proceeding of the 17th ACM conference on information and knowledge management, ACM New York, USA pp. 729-738, 2008.

29. Damjanovic. V., Gasevic. D., and Devedzic. V “Semiotics for Ontologies and Knowledge Representation”, In Proc. of Wissens management, pp.571-574, 2005.

30. Dinesh Mavaluru, R. Shriram and W. Aisha Banu, “Ensemble Approach for Cross Language Information Retrieval”, in 13th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2012), IIT-Delhi, New Delhi, Springer, Vol.,2, pp. 274-286, H-Index-8, 2012.

31. J. L. Herlocker, A. J. Konstan, L. G. Terveen, and J.T. Riedl, “Evaluating collaborative filtering recommender systems”. ACM Transactions on Information Systems, Vol. 22, No 1, pp. 5–53, 2004.

32. J. Shen and L.B. Eder, Intentions to Use Virtual Worlds for Education, Journal of Information Systems Education, Vol. 20, 2009, pp. 225-233

33. J.H. Sharp, Development, Extension and Application: A Review of the Technology Acceptance Model, Proceedings of Information Systems Educators Conference, vol. 23, 2006

34. Jan. Z, and Darena. F, “Discovering Opinions from Customers Unstructured Textual Reviews Written in Different Natural Languages”, pp.137-159, 2013

35. K. R. Beesley and L. Karttunen, Finite State Morphology. Stanford: CSLI Publications, 2003.

36. K. Saravanan, R. Udupa, and A. Kumaran, “Cross lingual Information Retrieval System Enhanced with Transliteration Generation and Mining” in Proceedings of Forum for Information Retrieval Evaluation (FIRE-2010), Kolkata, India, 2010.

37. K.C Manoj, R. Sagar, P. Bhattacharyya and P. Damani “Hindi and Marathi to English Cross Language Information Retrieval” at Cross-Language Evaluation. Forum 2007, Springer-Verlag Berlin, Heidelberg, ISBN: 978- 3-540-85759-4, pp 111-118, 2008.

38. Khan. A., and Naveed. A.M “Corpus Based Mapping of Urdu Characters for Cell Phones”, In Proceedings of the Conference on Language & Technology, 2009.

39. Koehn, P. Europarl “A parallel corpus for statistical machine translation”, In MT summit 2005.

40. Kumar Sourabh and Vibhakar Mansotra “Factors Affecting the Performance of Hindi Language searching on web: An Experimental Study”, in International Journal of Scientific & Engineering Research Volume 3, Issue 4, April-2012

41. Lazarinis. F., Jesus. S., and John. V “Current research issues and trends in non-English Web searching”, Springer Science, February 2009.

42. M. Anand and V. Dhanalakshmi “A Novel Data Driven Algorithm for Tamil Morphological Generator”, International Journal of Computer Applications, Vol.,12, no 7, pp. 52–56, 2010.

43. M. Federico, and N. Bertoldi “Statistical cross-language information retrieval using N-best query translations.” in Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 167–174. ACM Press, New York, 2002.

44. M. Ganapathiraju and L. Levin "TelMore: Morphological Generator for Telugu Nouns and Verbs”, in Proceedings of Second International Conference on Universal Digital Library, Alexandria, Egypt, pp. 17-19, 2006.

45. S. Kumar and V. Mansotra “An Experimental Analysis on the Influence of English on Hindi Language Information Retrieval” International Journal of Computational Linguistics Research Vol.,2, 2011.

46. S. Liaw, H. Huang, and G. Chen, Surveying instructor and learner attitudes toward e-learning, Journal of Computers & Education, vol.49, 2007, pp.1066-1080

47. S. Saraswathi, M. AsmaSiddhiqaa, K. Kalaimagal and M. Kalaiyarasi “BiLingual Information Retrieval System for English and Tamil”, Journal of Computing, Vol., 2, No., 4, ISSN 2151-9617, pp 85-89, 2010.

48. Saraswathi. S., Siddhiqaa. M., and Kalaimagal. K “Bilingual Information Retrieval System for English and Tamil”, Journal of Computing, 2(4), April 2010.

49. Saurabh Varshney, Jyoti Bajpai , “Improving Performance Of English-Hindi Cross Language Information Retrieval Using Transliteration Of Query Terms”, International Journal on Natural Language Computing (IJNLC), Vol. 2, No.6, December 2013.

50. Shriram, R. Sugumaran, V. and Vivekanandan K. ”A middleware for information processing in mobile computing platforms”, Int. J. Mob. Comm., Vol. 6, No. 5, pp. 646-666, 2008.

51. Sujatha. P and Dhavachelvan, “A Review on the Cross and Multilingual Information Retrieval”, International Journal of Web & Semantic Technology, Vol.2, No.4, pp:115-124, 2011.

52. T. Teo, Modelling technology acceptance in education: A study of pre-service teachers Computers & Education, The Turkish Online Journal of Educational Technology, vol. 11, 2012, pp. 264-272

53. V. Mallamma Reddy and M. Hanumanthappa “Kannada and Telugu Native Languages to English Cross Language Information Retrieval”, International Journal of Computer Science and Information Technologies, Vol., 2, No., 5, pp. 1876-1880, 2011.

54. Vijayanand. K., and Seenivasan. R.P “Named Entity Recognition and Transliteration for Telugu Language”, In Language in India , Special Volume: Problems of Parsing in Indian Languages, May 2011.

55. Wang. X., Broder. A., Gabrilovich. E., Josifovski. V., and Pang. B.: Cross-language query classification using web search for exogenous knowledge. In Proceedings of the Second ACM International Conference on Web Search and Data Mining, February 2009.

56. Machado, D., Barbosa, T., Pais, S., Martins, B. and Dias., G., “Universal Mobile Information Retrieval,” in Proc. UAHCI’09, 2009.

57. Maeda. A., and Kimura. F “An Approach to Cross-Age and Cross-Cultural Information Access for Digital Humanities”, In Digital Resources for the Humanities and Arts 2008 Conference (DRHA08), Cambridge, U.K., 2008.

58. Mallamma V Reddy, M. Hanumanthappa “Kannada and Telugu Native Languages to English Cross Language Information Retrieval” Department of Computer Science and Applications, Bangalore University, Bangalore, INDIA. (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 2 (5) , 2011.

59. Manaal F, Prasenjit M and Sebastian P “Soundex-based Translation correction in Urdu–English Cross-Language Information Retrieval”, Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, November 8-12, pp. 25-29, 2011.

60. Manish Shrivastava, “Morphology Based Natural Language Processing tools for Indian Languages,” Department of Computer Science and Engineering, Indian Institute of Technology, Powai, Mumbai, 2005.

61. Manning, C.D and Schutze, H “Foundations of Statistical Natural Language Processing”, The MIT Press, 2001.

62. Manuela Yapomo, Gloria Corpas, Ruslan Mitkov, “CLIR- and ontology-based approach for bilingual extraction of comparable documents”, PP.121-125.

63. Matthijs, N. and Radlinski, F., “Personalizing Web Search using Long Term Browsing History,” in Proc. WSDM’11, 2011.

64. Monti. J., Monteleone. M., di Buono. M.P., Marano. F.,”Natural Language Processing and Big Data - An Ontology-Based Approach for Cross-Lingual Information Retrieval”, Social Computing (SocialCom), 2013, PP.725 – 731.

65. Nasharuddin. N.A., and Abdullah. M “Cross-lingual Information Retrieval: State-of-the-Art”, In electronic Journal of Computer Science and Information Technology. Vol 2, 2010.

66. Nguyen. D et al, “WikiTranslate, Query Translation for Cross-lingual Information Retrieval using only Wikipedia”, In: Proceedings of CLEF, pp. 58–65, 2009.

67. P. Pingali, and V. Varma “Multilingual Indexing Support for CLIR using Language Modeling.” in Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, India, 2008.

68. P. Pingali, K. Kula, and V. Varma, “Hindi, Telugu, Oromo, English CLIR Evaluation”, in Evaluation of Multilingual and Multi-modal Information Retrieval, 7th Workshop of the Cross-Language Evaluation Forum, Alicante, Spain, Vol., 4730, 2007.

69. P.Sengottuvelan, A.Karthikeyan, “An Novel Approach Using Semantic Information Retrieval For Tamil Documents”, International Journal of Engineering Science and Technology, Vol. 2 No.9, 2010.

70. Petrelli. D., Levin. S., Beaulieu. M., and Sanderson. M “Which user interaction for cross-language information retrieval? Design issues and reflections”, In Journal of the American Society for Information Science and Technology, 57 (5), 709-722.