Dr. Narayan Kumar Choudhary


Dr. Narayan Kumar Choudhary Photo

Lecturer cum Junior Research Officer,
Central Institute of Indian Languages,
Department of Higher Education, Ministry of Education, Government of India
Manasagangotri, Mysuru - 570006, Karnataka, INDIA
Email: nchoudhary.ciil AT gmail {dot} com; n.choudhary AT gov {dot} in
Phone (O): 0821-234-5007 / 0821-234-5092
Personal Weblog: A Linguist's Take

Research Interests | Research Scholars | Publications | Invited Talks | Awards | Past Research & Work Experiences | Educational Qualifications | Memberships | Other Info

Research Interests


Computational Linguistics, Natural Language Processing (NLP) and Language Technology in general.
Corpus Linguistics, Data Science, Linguistic Annotation, Syntactic/Dependency Parsing, Multilingual Lexicography, and related areas.

Other Interests

Language Typology, Language Documentation, Language Policy and Planning, General Linguistics.

Current Institutional Responsibilities

Assistant Director (Admin) / Head of Administration & DDO (Since August 1, 2016 till 30 September, 2024)
Officer in-Charge, Linguistic Data Consortium for Indian Languages (LDCIL), (Since May 2017 till date)
Officer in-Charge, Computer Application Unit, (Since March 2016 till date)
Estate Officer (August 2016 - May, 2017; September, 2021 - till 30 September, 2024)

PhD Research Scholars


Awarded

  • Shivangi Priya, Research Title: Language Maintenance & Shift of the Maithili Language after its Inclusion in the Eighth Schedule: A Case Study Registration Year: 2015; Year of Submission: 2021; Year of Award: 2021
  • Ongoing

  • Shantanu Kumar, Research Title: Automatic Speech Recognition in Maithili: Issues and Challenges Registration Year: 2021;
  • Poulami Das, Research Title: Lexical Similarities in the Languages of India: A Perspective and a MethodRegistration Year: 2021;
  • MusabYassir Salem Almatarneh, Research Title: A Variation study of Arabic as spoken in Jordan Registration Year: 2021;
  • NOTE for Prospective PhD Scholars: If you wish to do PhD or take up a research under me, you would need to enroll at University of Mysore, India where CIIL is a recognized research Institution. Please note that CIIL does not directly provide any degree as it is not a recognized academic Institution by the University Grants Commission.
    If you wish to do any kind of research or development under me, please be very strong with your analytical skills and have a good grasp using technology in applying to your research ideas.Prospective Interns: You can take up internships with me, specially in the area of NLP, data science, AI and machine learning. To show an interest, write an email to me with the details of your ideas and research proposal.

    Publications


    Books

    1. Rejitha K. S. and Narayan Kumar Choudhary. (ed.). 2023. Compendium of LDC-IL Sentence Aligned Speech Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-19411-34-4.
    2. Syeda Mustafiza Tamim, Priyanshe Adhyapak, Rajesha N., Manasa G., Srikanth D., Stephen Fernandes, Nithin S., Narayan Kumar Choudhary, Shailendra Mohan. 2023. Assamese Sentence Aligned Speech CorpusCentral Institute of Indian Languages, Mysore. ISBN: 978-81-19411-53-5
    3. Sonali Sutradhar, Poulami Das, Rajesha N., Manasa G., Srikanth D.,Stephen Fernandes, Nithin S., Narayan Kumar Choudhary, Shailendra Mohan. 2023. Bengali Sentence Aligned Speech Corpus Central Institute of Indian Languages, Mysore. ISBN: 978-81-19411-48-1
    4. Satyaendra Kumar Awasthi, Ankita Tiwari, Rajesha N., Manasa G., Srikanth D., Stephen Fernandes, Nithin S., Narayan Kumar Choudhary, Shailendra Mohan. 2023.Hindi Sentence Aligned Speech CorpusCentral Institute of Indian Languages, Mysore. ISBN: 978-81-19411-28-3
    5. Vijayalaxmi F. Patil, Chetan Baji, Kavitha Lenin, Reshma S., Rajesha N., Manasa G., Srikanth D., Stephen Fernandes, Nithin S., Narayan Kumar Choudhary, Shailendra Mohan. 2023. Kannada Sentence Aligned Speech Corpus Central Institute of Indian Languages, Mysore. ISBN: 978-81-19411-19-1
    6. Saurabh Varik, Rajesha N., Manasa G., Srikanth D., Stephen Fernandes, Nithin S., Narayan Kumar Choudhary, Shailendra Mohan. 2023. Konkani Sentence Aligned Speech Corpus Central Institute of Indian Languages, Mysore. ISBN: 978-81-19411-62-7
    7. Shantanu Kumar, Dinesh Mishra, Rajesha N., Manasa G., Srikanth D., Stephen Fernandes, Nithin S., Narayan Kumar Choudhary, Shailendra Mohan. 2023. Maithili Sentence Aligned Speech Corpus Central Institute of Indian Languages, Mysore. ISBN: 978-81-19411-96-2
    8. Rejitha K. S., Sajila S., Saritha S.L., Rajesha N., Manasa G., Srikanth D., Stephen Fernandes, Nithin S., Narayan Kumar Choudhary, Shailendra Mohan. 2023. Malayalam Sentence Aligned Speech Corpus Central Institute of Indian Languages, Mysore. ISBN: 978-81-19411-58-0
    9. Bhageshree K Khandale, Rajesha N., Manasa G., Srikanth D., Stephen Fernandes, Nithin S., Narayan Kumar Choudhary, Shailendra Mohan. 2023. Marathi Sentence Aligned Speech Corpus Central Institute of Indian Languages, Mysore. ISBN: 978-81-19411-92-4
    10. Umesh Chamling Rai, Rupesh Rai, Rajesha N., Manasa G., Srikanth D., Stephen Fernandes, Nithin S., Narayan Kumar Choudhary, Shailendra Mohan. 2023. Nepali Sentence Aligned Speech Corpus Central Institute of Indian Languages, Mysore. ISBN: 978-81-19411-98-6
    11. Santosh Kumar Mohanty, Rajesha N., Manasa G., Srikanth D., Stephen Fernandes, Nithin S., Narayan Kumar Choudhary, Shailendra Mohan. 2023. Odia Sentence Aligned Speech Corpus Central Institute of Indian Languages, Mysore. ISBN: 978-81-19411-74-0.
    12. Amudha R., Kamaraj S., Rajesha N., Manasa G., Srikanth D., Stephen Fernandes, Nithin S., Narayan Kumar Choudhary, Shailendra Mohan. 2023. Tamil Sentence Aligned Speech Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-19411-26-9.
    13. Mansoor Khan, Shahnawaz Alam, Bi Bi Mariyam, Rajesha N., Manasa G., Srikanth D., Stephen Fernandes, Nithin S., Narayan Kumar Choudhary, Shailendra Mohan. 2023. Urdu Sentence Aligned Speech Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-19411-87-0.
    14. Rejitha K. S., Poulami Das, Rajesha N., Manasa G., Srikanth D., Nithin S., Narayan Kumar Choudhary, Shailendra Mohan. 2023. Indian English-Bengali variant Sentence Aligned Speech Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-19411-43-6.
    15. Rejitha K. S., Vijayalaxmi F. Patil, Rajesha N., Manasa G., Srikanth D., Nithin S., Narayan Kumar Choudhary, Shailendra Mohan. 2023. Indian English-Kannada variant Sentence Aligned Speech Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-19411-35-1.
    16. Rejitha K. S., Vijayalaxmi F. Patil, Rajesha N., Manasa G., Srikanth D., Nithin S., Narayan Kumar Choudhary, Shailendra Mohan. 2023. Indian English-Kannada variant Sentence Aligned Speech Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-19411-35-1.
    17. Satyaendra Kumar Awasthi, Ankita Tiwari, Narayan Kumar Choudhary. 2023. Chhattisgarhi Raw Speech Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-19411-78-8
    18. Ankita Tiwari, Satyaendra Kumar Awasthi, Narayan Kumar Choudhary 2023. A Gold Standard Chhattisgarhi Raw Text Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-19411-64-1
    19. Choudhary, Narayan (ed.). 2021. Compendium of Linguistic Resources in Indian Languages. Central Institute of Indian Languages, Mysore. ISBN: 978-81-948885-6-7.
    20. Ramamoorthy L., Narayan Kumar Choudhary, Atreyee Sharma, Jahnobi Kalita, Samhita Bharadwaj, Taznin Hussain, Priyanshe Adhyapak, Mustafiza Tamim, Rajesha N., Manasa. G. 2021 A Gold Standard Assamese Raw Text Corpus Central Institute of Indian Languages, Mysore. 978-81-948885-4-3.
    21. Ramamoorthy L., Narayan Kumar Choudhary, Atreyee Sharma, Ashmrita Gogoi, Jahnobi Kalita, Samhita Bharadwaj, Plabita Bora, Priyanshe Adhyapak, Mustafiza Tamim, Rajesha N., Manasa G.. 2021 Assamese Raw Speech Corpus Central Institute of Indian Languages, Mysore. 978-81-948885-5-0.
    22. Narayan Kumar Choudhary, Sunil Kumar Choudhary, Rajesha N., Manasa G.. 2021 Dogri Raw Speech Corpus Central Institute of Indian Languages, Mysore. 978-81-948885-7-4.
    23. Ramamoorthy L., Narayan Kumar Choudhary, Mona Parakh, Hiren Gadhavi R, Solanki Mahesh kumar R, Rejitha KS, Rajesha N., Manasa, G.. 2021 Gujarati Raw Speech Corpus Central Institute of Indian Languages, Mysore. 978-81-948885-0-5.
    24. Ramamoorthy L., Narayan Kumar Choudhary, Mona Parakh, Rejitha KS, Rajesha N., Manasa, G.. 2021 Gujarati Raw Speech Corpus (Mono Recordings) Central Institute of Indian Languages, Mysore. 978-81-948885-8-1.
    25. Ramamoorthy L., Narayan Kumar Choudhary, Arundhati Sengupta, Rejitha KS, Rajesha N., Manasa, G.. 2021 Indian English Raw Speech Corpus - Bengali Variant Central Institute of Indian Languages, Mysore. 978-81-948885-1-2.
    26. Ramamoorthy L., Narayan Kumar Choudhary, Bharatha Raju A., Rejitha KS, Rajesha N., Manasa, G.. 2021 Indian English Raw Speech Corpus - Kannada Variant Central Institute of Indian Languages, Mysore. 978-81-948885-9-8.
    27. Narayan Kumar Choudhary, Shahid Mushtaq Bhatt, Rajesha N., Manasa G.. 2021 Kashmiri Raw Speech Corpus Central Institute of Indian Languages, Mysore. 978-81-948885-2-9.
    28. Narayan Kumar Choudhary, Rajesha N., Manasa G.. 2021 Mulitilingual Raw Speech Corpus Central Institute of Indian Languages, Mysore. 978-81-948885-3-6.
    29. Ramamoorthy L., Narayan Choudhary, Raja Kumar Naik, Pramod Kumar Rout, Kshirod Kumar Das, Santosh Kumar Mohanty. 2021 Odia Raw Speech Corpus Central Institute of Indian Languages, Mysore. 978-93-91386-00-9.
    30. Ramamoorthy L., Narayan Choudhary, Thennarasu S, Prem Kumar L R, Amudha R, Prabagaran R. 2021 Tamil Raw Speech Corpus Central Institute of Indian Languages, Mysore. 978-93-91386-01-6.
    31. Ramamoorthy, L., Narayan Choudhary, Sonali Sutradhar, Arundhati Sengupta, Sankarshan Dutta, Priyanka Das & Saswati Karmakar. 2019. A Gold Standard Bengali Raw Text Corpus.Central Institute of Indian Languages, Mysore. 978-81-7343-204-0.
    32. Ramamoorthy, L., Narayan Choudhary, Bridul Basumatary & Farson Daimary. 2019. A Gold Standard Bodo Raw Text Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-209-5.
    33. Ramamoorthy, L., Narayan Choudhary & Sunil Kumar. 2019. A Gold Standard Dogri Raw Text Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-213-2.
    34. Ramamoorthy, L., Narayan Choudhary, Mona Parakh, Purva S Dholakia., Gadhavi R Hiren & Maheshkumar R Solanki. 2019. A Gold Standard Gujarati Raw Text Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-214-9.
    35. Ramamoorthy, L., Narayan Choudhary, Jitendra Kumar Singh, Richa, Anjali Sinha, Dheeraj Kumar Mishra, Arimardan Kumar Tripathi, Aditi Debsharma, Satyaendra Kumar Awasthi & Madhupriya Pathak. 2019. A Gold Standard Hindi Raw Text Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-219-4.
    36. Ramamoorthy, L., Narayan Choudhary, Vijayalaxmi F. Patil, Chetan Suryakant Baji, Malini N Abhyankar, Rajesha N. & Manasa G. 2019. A Gold Standard Kannada Raw Text Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-226-2.
    37. Ramamoorthy, L., Narayan Choudhary & Shahid Mushtaq Bhat. 2019. A Gold Standard Kashmiri Raw Text Corpus. A Gold Standard Kashmiri Raw Text Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-230-9.
    38. Ramamoorthy, L., Narayan Choudhary, Saurabh Varik, Rashmi Shet Tanawade & Yashwant D Gawas. 2019. A Gold Standard Konkani Raw Text Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-232-3.
    39. Ramamoorthy, L., Narayan Choudhary, Arun Kumar Singh & Dinesh Mishra. 2019. A Gold Standard Maithili Raw Text Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-236-1.
    40. Ramamoorthy, L., Narayan Choudhary, Saritha S.L., Rejitha K.S. & Sajila S. 2019. A Gold Standard Malayalam Raw Text Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-240-8.
    41. Ramamoorthy, L., Narayan Choudhary, Amom Nandaraj Meetei, Yumnam Premila Chanu, Longjam Anand Singh & M. Bidyarani Devi. 2019. A Gold Standard Manipuri Raw Text Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-245-3.
    42. Ramamoorthy, L., Narayan Choudhary, Gajanan R Apine & Apurva P Betkekar. 2019. A Gold Standard Marathi Raw Text Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-250-7.
    43. Ramamoorthy, L., Narayan Choudhary, Samar Sinha, Jeena Rai, Umesh Chamling Rai & Rupesh Rai. 2019. A Gold Standard Nepali Raw Text Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-255-2.
    44. Ramamoorthy, L., Narayan Choudhary, Raja Kumar Naik, Pramod Kumar Rout, Kshirod Kumar Das & Santosh Kumar Mohanty. 2019. A Gold Standard Odia Raw Text Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-258-3.
    45. Ramamoorthy, L., Narayan Choudhary, G. Palanirajan, S. Thennarasu, Prem Kumar L. R, Amudha R., Prabagaran R., Vijayan N. & M. Ramesh Kumar. 2019. A Gold Standard Tamil Raw Text Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-266-8.
    46. Ramamoorthy, L., Narayan Choudhary, Thirupal C Reddy & Gangaraju H. 2019. A Gold Standard Telugu Raw Text Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-271-2.
    47. Ramamoorthy, L., Narayan Choudhary, Mansoor Khan, Shahnawaz Alam, Bi Bi Mariyam & Rushda Idris Khan. 2019. A Gold Standard Urdu Raw Text Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-274-3.
    48. Ramamoorthy, L., Narayan Choudhary, Sonali Sutradhar, Priyanka Biswas, Arundhati Sengupta, Sankarshan Dutta & Priyanka Das. 2019. Bengali Raw Speech Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-206-4.
    49. Ramamoorthy, L., Narayan Choudhary, Bridul Basumatary & Farson Daimary. 2019. Bodo Raw Speech Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-211-8.
    50. Ramamoorthy, L., Narayan Choudhary, Jitendra Kumar Singh, Richa, Anjali Sinha, Dheeraj Kumar Mishra, Arimardan Kumar Tripathi & Satyaendra Kumar Awasthi. 2019. Hindi Raw Speech Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-221-7.
    51. Ramamoorthy, L., Narayan Choudhary, Vijayalaxmi F. Patil, Chetan Suryakant Baji, Malini N. Abhyankar, Rajesha N. & Manasa G. 2019. Kannada Raw Speech Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-228-6
    52. Ramamoorthy, L., Narayan Choudhary, Saurabh Varik & Rashmi Shet Tanawade. 2019. Konkani Raw Speech Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-234-7.
    53. Ramamoorthy, L., Narayan Choudhary, Arun Kumar Singh, Dinesh Mishra & Atuleshwar Jha. 2019. Maithili Raw Speech Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-238-5.
    54. Ramamoorthy, L., Narayan Choudhary, Gajanan R Apine & Apurva P Betkekar. 2019. Marathi Raw Speech Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-251-4.
    55. Ramamoorthy, L., Narayan Choudhary, Samar Sinha, Jeena Rai, Umesh Chamling Rai & Rupesh Rai. 2019. Nepali Raw Speech Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-255-2.
    56. Ramamoorthy, L., Narayan Choudhary, Poonam Dhillon & Sarbjeet Kaur. 2019. Punjabi Raw Speech Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-264-4.
    57. Ramamoorthy, L., Narayan Choudhary & Rajesha N. 2019. Telugu Raw Speech Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-272-9.
    58. Ramamoorthy, L., Narayan Choudhary, Mansoor Khan, Shahnawaz Alam, Bi Bi Mariyam & Rushda Idris Khan. 2019. Urdu Raw Speech Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-276-7.
    59. Ramamoorthy, L., Narayan Choudhary, Poonam Dhillon, Sarbjeet Kaur & Sandeep Singh. 2019. A Gold Standard Punjabi Raw Text Corpus. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-262-0.
    60. Choudhary, Narayan (ed.). 2019. Linguistic Resources for AI/NLP in Indian Languages. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-295-8.
    61. Choudhary, Narayan. 2018. Cost Analysis of Linguistic Resources. Central Institute of Indian Languages, Mysore. ISBN: 978-81-7343-283-5.
    62. Narayan Choudhary, Gibu Sabu M. (eds.). 2011. Proceedings of the Third Students’ Conference of Linguistics in India (SCONLI-3). Parimal Publishers, New Delhi.
    63. Kalika Bali, Monojit Choudhury, Priyanka Biswas, Girish Nath Jha, Narayan Choudhary, Maansi Sharma. 2010. Indian Language Part-of-Speech Tagset: Hindi.. Linguistic Data Consortium, Philadelphia.

    Research Papers in Refereed Journals/Conferences

    1. Rejitha K. S.; Rajesha N.; Narayan Kumar Choudhary. 2023. Type-Token Analysis on LDC-IL Text Corpus. In Proceedings of 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA), Delhi, India, 2023. pp. 1-6. doi: 10.1109/O-COCOSDA60357.2023.10482978.
    2. Rajesha N.; Rejitha K. S.; Narayan Kumar Choudhary. 2023. Evaluation of Assamese Speech Data Transcriptions by Levenshtein Distance. In 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA), Delhi, India, 2023, pp. 1-4, doi: 10.1109/O-COCOSDA60357.2023.10482939.
    3. Narayan Choudhary, Shantanu Kumar. 2023. भारतीय भाषाओं में वर्तनियों के मानकीकरण की स्थिति (Status of Spelling Standardization in Indian Languages). भाषा (Bhasha, ISSN: 05231418). वैज्ञानिक तथा तकनीकी शब्दावली आयोग (Commission for Scientific and Technical Terminology, CSTT), pp. New Delhi.
    4. Choudhary, Narayan; Premkumar, LR; Singh, Chandan; Mondal, Shubhanan; Shivangi, Priya; Sudarshan, Beluru, Perumal Samy, Mohan, Shailendra. 2023. Bharatavani Project - Reviving Linguistic Diversity and Cultural Heritage in India: A Case Study, In Proceedings of the 2nd International Workshop on Digital Language Archives: LangArc 2023. University of North Texas Libraries, UNT Digital Library, https://digital.library.unt.edu; crediting UNT College of Information. pp. 5-7. doi: https://doi.org/10.12794/langarc2114300
    5. Shivangi Priya, Pushp Lata, Narayan Choudhary. 2021. Patterns of Language Use and Language Preferences of Maithili Speakers in Bihar. In Mithila Bharati, issue No. VIII, 2021. pp. 84-97.
    6. N. Choudhary and D. G. Rao. 2020. The LDC-IL Speech Corpora. In Proceedings of 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA), Yangon, Myanmar, 2020. pp. 28-32, doi: https://doi.org/10.1109/O-COCOSDA50338.2020.9295011
    7. Choudhary, N. 2021. LDC-IL: The Indian Repository of Resources for Language Technology. Language Resources & Evaluation. Springer, Vol. 55, Issue 1. doi: https://doi.org/10.1007/s10579-020-09523-3
    8. Parth Pathak, Pinal Patel, Vishal Panchal, Sagar Soni, Kinjal Dani, Narayan Choudhary, Amrish Patel. 2015. ezDI: A Supervised NLP System for Clinical Narrative Analysis. In: Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015). Denver, Colorado. (Accorded first rank in the shared task)
    9. Neha Dixit and Narayan Choudhary. 2014. Evaluating Two Annotated Corpora of Hindi Using a Verb Class Identifier. In Proceedings of ICON 2014. Goa University, Goa (To appear in ACL Anthology).
    10. Neha Dixit and Narayan Choudhary. 2014. Automatic Classification of Hindi Verbs in Syntactic Perspective. International Journal of Emerging Technology and Advanced Engineering, Volume 4, 8th Issue. ( ISSN 2250 – 2459 (Online))
    11. Parth Pathak, Pinal Patel, Vishal Panchal, Narayan Choudhary, Amrish Patel, Gautam Joshi. 2014. ezDI: A Hybrid CRF and SVM based Model for Detecting and Encoding Disorder Mentions in Clinical Notes. In: Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014 Shared Task, awarded third best result). Dublin, Ireland. ISBN 978-1-941643-29-7
    12. Narayan Choudhary, Parth Pathak, Pinal Patel, Vishal Panchal. 2014. Annotating a Large Representative Corpus of Clinical Notes for Parts of Speech. 2014. In: Proceedings of 8th Linguistic Annotation Workshop, Dublin, Ireland. ISBN 978-1-941643-29-7
    13. Narayan Choudhary, Girish Nath Jha. 2011.Creating Multilingual Parallel Corpora in Indian Languages. 2011. In Proceedings of the 5th Language & Technology Conference, Poznan, Poland. (awarded the Best Student Paper)
    14. Narayan Choudhary, Girish Nath Jha and Pramod Pandey. 20011. A Rule based Method for the Identification of TAM features in a PoS Tagged Corpus. In Proceedings of the 5th Language & Technology Conference, Poznan, Poland.
    15. Narayan Choudhary. 2011. Web-drawn corpus for Indian Languages: A Case of Hindi. In Proceedings of Information Systems for Indian Languages. Volume 139, Part 2, 218-223. Springer Verlag.
    16. Narayan Choudhary. 2008. बोधात्मक भाषाविज्ञान, in Gaveshanaa, April-June, 2008 vol.:90/2008 Central Institute of Hindi, Agra. pp.:11-18 (This is a Hindi translation of the article “Cognitive Linguistics” from Encyclopedia of Linguistics by Gilles Falkner, 2006)
    17. Narayan Choudhary. 2007. Syllable Structure of Great Andamanese, November, 2006. In proceedings of National Seminar on Perspectives in Linguistics, Kashmir University, Srinagar, Kashmir. India. Pp. 141-146
    18. Narayan Choudhary, Anvita Abbi, Girish Nath Jha. 2007. Morphological Analyzer for Great Andamanese Verbs: Implementing a Concatenative Template. In Vishwabharat ( April 2007 - January 2008 Journal) TDIL, New Delhi, pp.113-118 http://tdil.mit.gov.in/april-jan-2008/8.8_Morphological_analyzer.pdf

    Invited Talks a few selected ones


    1. “NLP and Information Extraction”, SCONLI-07, Aligarh Muslim University, Aligarh, 8-10 February, 2013
    2. Orientation Course in Computational Linguistics, Tezpur University, Assam. 21-23 December, 2014
    3. Translation Process State, Knowledge Commission of Gujarat, Ahmedabad. 01-02-2016
    4. Current Trends in Translation Industry, National Translation Mission, CIIL, Mysore. 03-03-2017
    5. Translation Practices in the IT Industry, National Translation Mission, CIIL, Mysore. 04-03-2017
    6. Introduction to Translation, National Translation Mission, CIIL, Mysore. 16-05-2017
    7. ICT based teaching of English Training Programme for Secondary Teachers (AP, Telangana and TN), 3-09-2017 RIE, NCERT, Mysore
    8. Ways of Empowering Language using Language Technlogy, Language Across Curriculum in Multi-Linguistic Context: Scope and Challenges, 29-31 January, 2018 RIE, Mysore

    Awards


    1. Ranked 1st in “SemEval-2015 Task 14: Analysis of Clinical Text”, to occur in NAACL-2015, Denver, Colorado.
    2. Ranked 3rd in “SEMEVAL 2014: Shared Task 7: Analysis of Clinical Text”, Dublin, Ireland
    3. Best Student Paper Award for the paper titled “Creating Multilingual Parallel Corpora in Indian Languages” at LTC’11, Poznan, Poland
    4. UGC-NET Lectureship Award, 2003 and 2004 br

    Past Research Work Experiences


    1. NLP Research Engineer, ezDI, LLC.: July, 2012 – February, 2016
    2. Senior Linguist, Shallow Parser Tools for Indian Languages, JNU, New Delhi: May, 2012 – June, 2012
    3. Senior Linguist, Indian Languages Corpora Initiative (ILCI), JNU, New Delhi: March, 2009 – May, 2010
    4. Teaching Assistant, Centre for Linguistics and Special Centre for Sanskrit Studies, JNU, New Delhi: August, 2007 - July, 2009
    5. Research Assistant, Centre for Linguistics, JNU, New Delhi: August, 2005-July, 2007
    6. Project Associate, CSE, IIT Kanpur: May, 2005-June, 2005

    Educational Qualifications


    1. Ph. D. Thesis Title: Automatic Identification and Analysis of Verb Groups in Hindi. JNU. 2006-2011.
    2. M. Phil. Dissertation Title: Developing a Computational Framework for the Verb Morphology of Great Andamanese. JNU, New Delhi, 2006
    3. National Eligibility Test for Lectureship (NET -December, 2003; June, 2004) of the UGC
    4. Masters (Linguistics), JNU, New Delhi, 2004. MA Thesis: Word Order in Pnar (Jaintia)
    5. Bachelor of Arts (English Hons., Economics, History, Hindi), LNMU, Darbhanga, 2001.

    Memberships


    Linguistic Society of India, Life Member
    Association of Computing Machinery, Student Member, 2010-2013

    OtherInfo


    Computing Skills

    Platforms: Well versed with Windows and Linux (Ubuntu/RHEL)
    Development Environment: MySQL 5; PHP, JAVA, Python, C++, Perl, Prolog, LISP, CSS, HTML

    Languages

    Well Versed with Expertise: English, Hindi, Maithili
    Academic Knowledge: Sanskrit, Pnar (Jaintia), Great Andamanese, Gujarati, Kannada and many more

    Hobbies

    Reading, Writing, Music, Yoga, Swimming, Mountaineering.