Skip to content
View udit01's full-sized avatar
💭
Contact me for projects
💭
Contact me for projects

Highlights

  • Pro

Block or report udit01

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
udit01/README.md

Udit Jain

Agentic AI Engineer | LLM Post-Training | Samsung Research AI Core Team

Portfolio LinkedIn Email


About

AI engineer at Samsung Research's AI Core Team in Seoul, building agentic AI systems through LLM post-training, reinforcement learning, and knowledge distillation. IIT Delhi CS, CGPA 9.10.

Previously shipped on-device computer vision models to millions of Samsung smart appliance users and ran health data analysis across Samsung Health's 200M+ user dataset.

What I Work On

LLM Post-Training & Alignment — End-to-end SFT, RLHF, and DPO pipelines for instruction following and safety alignment of large language models.

Reinforcement Learning for LLMs — Reward modeling, policy optimization (PPO variants), process-level and outcome-level reward signals for improving reasoning, planning, and agentic task completion.

Knowledge Distillation — Transferring frontier-model capabilities to deployable student models while maintaining benchmark performance at lower compute and latency.

Agent Evaluation — Multi-turn evaluation pipelines and benchmark suites for tool use, long-context reasoning, and autonomous task completion at scale.

Training Infrastructure — Distributed training with NeMo on multi-node GPU clusters. Large-scale inference and evaluation with vLLM.

Technical Stack

Area Technologies
LLM Training NeMo, vLLM, PyTorch, Hugging Face Transformers, PEFT/LoRA
Methods SFT, RLHF, DPO, PPO, reward modeling, knowledge distillation
Computer Vision OpenCV, TensorFlow, object detection, segmentation, NPU optimization
Data & Cloud AWS (S3, EMR, Glue, Athena), Spark, BigQuery, Apache Superset
Languages Python, C/C++, Kotlin, Java
Systems Distributed GPU clusters, Docker, Linux kernel, FPGA

Career at Samsung Research (2020 - Present)

Agentic AI Engineer, AI Core Team (2026 - Present) Post-training pipelines, RL for LLMs, distillation, agent evaluation frameworks.

Computer Vision Engineer, Digital Appliances R&D (2023 - 2025) On-device food recognition for smart refrigerators — <40MB model, <70ms inference, >90% accuracy on NPU. Showcased at CES 2024. Product Video

Software Engineer, Data Service Lab (2022 - 2023) Built Samsung Health Research Application connecting devices to clinical research. Won UX Design Award 2023. Press Release

Data Science Engineer, Data Intelligence Lab (2020 - 2022) Health data analysis on 200M+ users (300TB on AWS). Global sleep study featured on Korean national TV and Samsung Newsroom.

Research

  • Harvard University, Visual Computing Group (2020) — Differentiable rendering, GPU-accelerated Monte Carlo simulation
  • Samsung Research, Data Analytics Lab (2019) — Multilingual NLP for semantic extraction from B2C communications
  • Kyutech Institute of Technology, Japan (2018) — Elderly care robotics with Mask R-CNN + Baxter robot. Featured on Japanese National TV.

Publications:

Selected Achievements

KVPY Fellowship All India Rank 3 / 500,000 students (IISc Bangalore, 2015)
JEE Advanced All India Rank 81 / 200,000 qualified (2016)
IIT Delhi B.Tech CS, CGPA 9.10/10.0, Institute Merit Award (top 7%)
Samsung Best Paper Award (2022), AI & Cloud certifications, Data Science L2
Physics Olympiad National Top 35, Gold Medal (HBCSE-Bombay)

What's Next

Building toward the intersection of agentic AI systems and real-world deployment at scale. Interested in the full stack of post-training — from reward signal design to evaluation infrastructure to production serving. Long-term: building something of my own.


Portfolio

Pinned Loading

  1. 3D-Visualization-and-Conversion-Tool 3D-Visualization-and-Conversion-Tool Public

    HTML 3 2

  2. devclub-iitd/SenData devclub-iitd/SenData Public

    Simple tool for sending file in an Intranet environment

    TypeScript 17 8

  3. analyzer_bot analyzer_bot Public

    JavaScript 3

  4. Disk-Simulator-for-Data-Recovery Disk-Simulator-for-Data-Recovery Public

    Java 3 1

  5. niki-amini-naieni/CountGD niki-amini-naieni/CountGD Public

    Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.

    Python 316 37

  6. nntrainer/nntrainer nntrainer/nntrainer Public

    NNtrainer is Software Framework for Training and Inferencing Neural Network Models on Devices.

    C++ 207 119