Rahul Sharma


2023

pdf bib
Ranking LLM-Generated Loop Invariants for Program Verification
Saikat Chakraborty | Shuvendu Lahiri | Sarah Fakhoury | Akash Lal | Madanlal Musuvathi | Aseem Rastogi | Aditya Senthilnathan | Rahul Sharma | Nikhil Swamy
Findings of the Association for Computational Linguistics: EMNLP 2023

Synthesizing inductive loop invariants is fundamental to automating program verification. In this work we observe that Large Language Models (such as gpt-3.5 or gpt-4) are capable of synthesizing loop invariants for a class of programs in a 0-shot setting, yet require several samples to generate the correct invariants. This can lead to a large number a calls to a program verifier to establish an invariant. To address this issue, we propose a re-ranking approach for the generated results of LLMs. We have designed a ranker that can distinguish between correct inductive invariants and incorrect attempts based on the problem definition. The ranker is optimized as a contrastive ranker. Experimental results demonstrate that this re-ranking mechanism significantly improves the ranking of correct invariants among the generated candidates, leading to a notable reduction in the number of calls to a verifier.

2022

pdf bib
Federated Learning with Noisy User Feedback
Rahul Sharma | Anil Ramakrishna | Ansel MacLaughlin | Anna Rumshisky | Jimit Majmudar | Clement Chung | Salman Avestimehr | Rahul Gupta
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

Machine Learning (ML) systems are getting increasingly popular, and drive more and more applications and services in our daily life. Thishas led to growing concerns over user privacy, since human interaction data typically needs to be transmitted to the cloud in order to trainand improve such systems. Federated learning (FL) has recently emerged as a method for training ML models on edge devices using sensitive user data and is seen as a way to mitigate concerns over data privacy. However, since ML models are most commonly trained with label supervision, we need a way to extract labels on edge to make FL viable. In this work, we propose a strategy for training FL models using positive and negative user feedback. We also design a novel framework to study different noise patterns in user feedback, and explore how well standard noise-robust objectives can help mitigate this noise when training models in a federated setting. We evaluate our proposed training setup through detailed experiments on two text classification datasets and analyze the effects of varying levels of user reliability and feedback noise on model performance. We show that our method improves substantially over a self-training baseline, achieving performance closer to models trained with full supervision.

2014

pdf bib
A hybrid approach for automatic clause boundary identification in Hindi
Rahul Sharma | Soma Paul
Proceedings of the Fifth Workshop on South and Southeast Asian Natural Language Processing

pdf bib
A rule based approach for automatic clause boundary detection and classification in Hindi
Rahul Sharma
Proceedings of the Fifth Workshop on South and Southeast Asian Natural Language Processing

pdf bib
Exploring the effects of Sentence Simplification on Hindi to English Machine Translation System
Kshitij Mishra | Ankush Soni | Rahul Sharma | Dipti Sharma
Proceedings of the Workshop on Automatic Text Simplification - Methods and Applications in the Multilingual Society (ATS-MA 2014)

2013

pdf bib
Automatic Clause Boundary Annotation in the Hindi Treebank
Rahul Sharma | Soma Paul | Riyaz Ahmad Bhat | Sambhav Jain
Proceedings of the 27th Pacific Asia Conference on Language, Information, and Computation (PACLIC 27)