Ashmal Vayani

I am an MSc. student in the College of Engineering and Computer Science department at the University of Central Florida. I am a member of the Center for Research in Computer Vision (CRCV) Lab advised by Prof. Mubarak Shah.

Previously, I was a Research Engineer in the Computer Vision Department, affiliated with the IVAL-Lab at Mohamed bin Zayed University of Artificial Intelligence (MBZUAI). I was advised by Prof. Fahad Khan, and Dr. Salman Khan (Aug 2023 - Jul 2024). During my undergrad, I worked as a Research Intern at Retrocausal with Dr. Zeeshan Zia. I completed my Bachelors from National University of Computer and Emerging Sciences majoring in Computer Science (Aug 2019 - June 2023).

Email  /  CV  /  Google Scholar  /  Github  /  LinkedIn

profile photo

Research Interests

I mostly work on Large Language Models, Vision Language Models, their efficiency, and building downstream industrial applications using RAG methods and LLM deployment. I have also curated high-quality datasets and benchmarks for Multilingual LMMs, Bias Mitigation, and Industrial Applications for the MENA region.

News

  • [Oct 2024] - One Paper Accepted at NeurIPS Vision Language Models Workshop, 2024.
  • [Aug 2024] - Joined the UCF (CRCV) as a Master's in Computer Vision Student.
  • [Jan 2024] - Promoted to a Research Engineer at MBZUAI.
  • [Dec 2023] - Merit Award - Tertiary Student Project APICTA Awards held in Hong Kong 2023.
  • [Dec 2023] - People's Choice Award at APICTA Awards held in Hong Kong 2023.
  • [Nov 2023] - Released the Jais Climate as a Lead Engineer.
  • [Aug 2023] - Joined the MBZUAI as a Research Assistant.
  • Publications

    * denotes joint first authors

    VURF-diag All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
    Ashmal Vayani, Dinura Dissanayake, Hasindri Watawana, Omkar Thawakar, Michael Felsberg, Thamar Solorio, Monojit Choudhury, Ivan Laptev, Mubarak Shah, Salman Khan, Fahad Shahbaz Khan
    Under Review
    Paper / Code
    SB-Bench-Figure SB-Bench: Sstereotype Bias Benchmark for Large Multimodal Models
    Vishal Narnaware*, Ashmal Vayani*, Rohit Gupta, Swetha Sirnam, Mubarak Shah
    Under Review
    Paper

    VURF-diag GAEA: World-Wide Geo-localization Assistant
    Ron Compas, Ashmal Vayani, Parth Kulkarni, Aritra Dutta, Mubarak Shah
    Under Review
    Paper

    VURF-diag VURF: A General-purpose Reasoning and Self-refinement Framework for Video Understanding
    Ashmal Vayani*, Ahmad Mahmood*, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan
    NeurIPS Vision Language Models Workshop 2024.
    Paper

    Mobillama-diag MobiLlama: Towards Accurate and Lightweight Fully Transparent GPT
    Omkar Thawakar*, Ashmal Vayani*, Salman Khan, Hisham Cholakkal, Rao Muhammad Anwer, Michael Felsberg, Timothy Baldwin, Eric P. Xing, Fahad Shahbaz Khan
    Under review
    Paper / Code