Welcome to Yaswanth’s Homepage!

Illustration I am a graduate researcher who is working on Language Models and I did my undergrad and masters from the Center for Excellence in Artificial Intelligence , Indian Institute of Technology Kharagpur. I was also an undergraduate researcher at the Complex Networks Research Group, IIT Kharagpur under the supervision of Prof. Pawan Goyal for the past 3 years. If you have any ideas that you would like to collaborate on, hit me up!

Publications 📚

Research Internships 🧪

Microsoft Logo      UC Berkeley Logo      UIUC Logo      UT Dallas Logo      IIT Kharagpur Logo

News 🏆📚

  • [Apr 2024] Waterloo Research: Started research internship at University of Waterloo’s TIGER Lab. 🎓
  • [Dec 2023] Grant Increase: Microsoft Research doubled our grant to $40,000! 🎉
  • [Oct 2023] Azure A100 Deployment: Launched A100 (80GB GPU) Server on Azure with 220GB RAM. 🚀
  • [Oct 2023] EMNLP 2023: First author paper accepted at EMNLP 2023 (Findings). 🎉
  • [Sep 2023] EMNLP 2023: Served as a Student Paper Reviewer. 🔍
  • [Apr 2023] Best Thesis Awards: Received highest grades for both Bachelor’s and Master’s theses. 🥇
  • [Apr 2023] Best Term Papers: Awarded for “Natural Language Processing”, “Graph Machine Learning”, “AI Design Lab”. 📝

Research Experience

🤖 FinAgent: Financial Trading Agent [Apr 2024 - Present]

  • University of Waterloo: Developing a multimodal agent for enhanced financial trading and stock portfolio optimization. Created comprehensive dataset of financial news and stock prices for 100 companies.

📊 Automatic Evaluation Framework using LLMs [Aug 2023 - Apr 2024]

  • Microsoft Research India: Developed a 3-stage evaluation framework with LLM agents (GPT-3.5, Mixtral-8*7b, Gemini-pro) for NLG evaluation. Achieved SOTA correlation with human evaluation.

📑 Event Extraction using Large Language Models [October 2023 - Present]

  • University of Texas, Dallas: Leading a research project focusing on document-level and sentence-level event extraction with GPT-3.5, GPT-4 and various open source Large Language Models.

📚 Citation Integrity [June 2023 - September 2023]

  • University of Illinois at Urbana-Champaign: Developed a new dataset, examined the integrity of citations, and classified them into categories, such as “irrelevant citation,” while also extracting evidence from the cited papers.

🌐 Machine Translation using Large Language Models [May 2023 - Sept 2023]

  • University of California, Berkeley: Enhanced 500K lines of machine translation data with synthetic generation from LLMs, employing Google-OCR, BERT-based aligners, and Sandhi splitting techniques. Notably increased BLEU scores by 10% using the “No Language Left Behind” (NLLB) model and advanced post-processing methods.

🚦 Compressing Yolo Object Detection using NN-LUT [August 2023 - September 2023]

  • Indian Institute of Technology, Kharagpur: Developed a pedestrian detection system with YOLOv4 architecture, trained on the EuroCity Persons dataset and compressed the model using Neural Network based Look up table for self driving cars. Also deployed a GRU model with autoencoder for intrusion detection using CAN dataset.

🧩 Nested Compound Parsing and Type Identification [December 2022 - April 2023]

  • Indian Institute of Technology, Kharagpur: Introduced a novel task, dataset and framework focused on identifying correct parsing and semantic relations between components of compounds in Sanskrit.

💡 Commonsense Injection to Multimodal Reasoning Models [February 2023 - April 2023]

  • Indian Institute of Technology, Kharagpur: Proposed a method to mitigate common sense mistakes made by the Multimodal Chain-of-Thought reasoning in language models by incorporating commonsense knowledge via knowledge graphs.

❓ Asking Clarifying Questions for Dialogue Systems [February 2023 - April 2023]

  • Indian Institute of Technology, Kharagpur: Focused on Asking Clarifying Questions in an open-domain language system, encompassing two subtasks: determining when to ask a clarifying question and which question to ask.

🏷️ Semantic Tag Recommendation Framework for Quotes [August 2022 - November 2022]

  • Indian Institute of Technology, Kharagpur: Predicted various relevant tags from a list of 39,000 available categories for a given Quote and used the context of each Quote as well.

🧠 Research Interests

  • 🤖 Large Language Models
    • 🖼️ Multimodality
    • 🧐 Reasoning
    • 💭 Hallucinations
    • 🎯 Reasoning & Planning
    • 🎮 Agentic Behavior
    • ✅ Evaluation

✍️ Blog Posts

🎉 Hobbies & Interests

  • ♟️ Chess Enthusiast: I’m an avid chess player and have won multiple district-level tournaments. Fancy a game? Find me on chess.com and let’s play!
  • 🏀 Basketball & 🏐 Volleyball: Whether it’s shooting hoops or spiking the ball, I love staying active and playing sports in my free time.
  • 👨‍🍳 Cooking & 🌎 Travelling: Exploring new cuisines and cultures is my passion. I find joy in cooking up a storm in the kitchen and embarking on culinary adventures.
  • 🧘‍♂️ Meditation: In the midst of life’s hustle and bustle, I find peace and rejuvenation through meditation. It’s my way of staying grounded and mindful.
  • 🧢 High School Captain: I took on the mantle of leadership in high school, guiding and inspiring my peers as the school captain.