cv

Research interests

I am interested in efficient Deep learning, particularly making Large Language models and Vision models more efficient through quantization, pruning, and PEFT. I am also interested and have experience in modern Deep Learning paradigms such as Multimodal models, LLMs, and, INRs.

Work

  • 2024.04 - 2024.10
    Google DeepMind
    Student Researcher, Perception: Humans & Interactions team
    • Training ultra-low precision Mixture-of-Experts model for approximating general floating point models.
    • Designing efficient quantization schemes for multi-modal generative models with theoretical guarantees.
  • 2019.03 - 2021.08
    Texas Instruments
    Systems Engineer, High Speed Signal Conditioning team
    • Designed an Adaptive Equalizer in RX of Retimer SoCs to reduce Inter Symbol Interference at data rates of 56Gbps.
    • Developed a MMSE adaptation algorithm for fast convergence and a steady-state BER of <1e-15.
  • 2018.07 - 2019.02
    Texas Instruments
    Systems Engineer, Wireless Infrastructure Team
    • Developed fixed point models for DDC and DUC chains of 5G Transceiver SoCs and analyzed their SNR and SFDR.
    • Identified key contributors of spurs in the DDC chain and devised solutions to maintain SFDR below -100dBc.

Education

  • 2024.01 - 2026.05

    Madison, WI

    PhD
    University of Wisconsin-Madison
    Electrical and Computer Engineering
  • 2021.09 - 2026.05

    Madison, WI

    MS
    University of Wisconsin-Madison
    Electrical and Computer Engineering
  • 2018.07 - 2018.05

    Chennai, India

    B.Tech
    Indian Institute of Technology, Madras
    Electrical Engineering

Publications

Research projects

  • 2022.09 - 2022.12
    Data Augmentation for Image Classification using Latent Diffusion Models
    • Generated synthetic images using Stable Diffusion for CIFAR-10 dataset and trained a ResNet18 model.
    • Fine-tuned the Stable Diffusion using Textual Inversion, Dream-Booth to improve the accuracy to ∼82%
  • 2022.01 - 2022.06
    Performance of Multi-Image Comparison Queries
    • Proved that the Cross Cluster Edge Density for queries with 3 to 8 images per query is better than the classical pair comparison queries under Conditional Block Model.
    • Implemented a Python program to generate mathematical proof and derive the expressions for cross cluster edge density given the number of images per query.
  • 2017.12 - 2018.05
    Efficiency Model for Buck Converter
    • Developed an Empirical model for the total power loss in a Buck converter with maximum error <3%.
    • Derived an analytical expression for the Optimum sizes of switches and the frequency of operation of the Buck Converter.

Course projects

  • 2021.09 - 2021.12
    Robust Image Classification
    • Processed noisy imaged with DCT, DWT and improved the classification accuracy by 15% on the CIFAR-10 dataset.
    • Implemented a Bayesian uncertainty-based two-layer defense to defend a classifier against Adversarial attacks such as FGSM, BIM, and CW L2.
  • 2017.02 - 2017.05
    Systematic Encoding and Decoding of BCH Codes
    • Designed a systematic encoder and decoder for a binary BCH code with error and erasure correcting capabilities.
    • Implemented the simplified Berlekamp-Massey algorithm for binary codes to determine the error location polynomial.

Skills

Deep Learning
Transformers
LLMs
Diffusion Models
Computer Vision
Quantization
PEFT
INR
Coding
Python
Pytorch
Jax
Assembly
C
C++
Signal Processing
Harmonic Analysis
Wavelets
MatLab
Mathematics
Abstract Algebra
Probability
Analysis

Awards

  • Recipient of GCP Credit Award in support of my research on Efficient Transformer Models.
  • Graduated with “Honours” in Electrical Engineering from IIT Madras, 2018 (Top 7%).
  • Secured an All India Rank of 2293(Top 1.5%) in engineering Joint Entrance Exam Advanced 2014.