Rohan Wadhawan

Life is like a puzzle in progress. In the end, we all wish that the pieces come together into a masterpiece...

I am interested in Human-inspired Artificial Intelligence research at the intersection of Computer Vision, Multimodal Learning, Generative AI, Affective Computing, and Computational Neuroscience. I want to develop Eco-friendly technology, Equally Accessible to All. Moreover, I find the world of cameras and the world captured by a camera exhilarating. I am currently working as a Software Engineer in the Visual Intelligence Team at Samsung R&D Institute in Bangalore.


Education

Netaji Subhas Institute of Technology, an affiliate of Delhi University

Bachelors of Engineering in Computer Engineering
August 2016 - July 2020

Admission: Secured 2520 rank in all India Joint Engineering Entrace (JEE) Main exam, 2016.
Placed in Top 0.2% out of 1.2M candidates.

CGPA: 8.94/10.00 (1st division with Distinction, 89.4%)

Relevant Coursework: Mathematics (Linear Algebra, Multivariate Calculus), Discrete Structres (Logic, Counting Principles, Probability, Graph Theory), Algorithms, Artificial Intelligence, Neural Networks, Big Data and Analytics.

Bachelor's Thesis title: Face Synthesis using Descriptions Extracted from Unstructured Text


Work Experience

Senior Software Engineer

Visual Intelligence Team, Samsung R&D Institute, Bangalore
  • Developing new features and enhancing quality of the Expert RAW and Night Photography camera pipelines in the flagship models like S22.
February 2022 - Present

Software Engineer

Visual Intelligence Team, Samsung R&D Institute, Bangalore
  • Worked on commercialization, deployment, and performance optimization of various camera solutions like Video Stabilization, Hyperlapse, Portrait, and Single Take on existing and upcoming S series (S22), A series (A73), and M series (M53) smartphones and Tablets (Tab S8).
  • Ivestigated new camera modes and improvements in the image signal processing pipeline.
January 2021 - January 2022

Software Engineer - Artificial Intelligence Intern

Nable IT Consultancy Services, a computer vision startup
  • Improved the edge-based facial recognition system's accuracy and efficiency and made it agnostic to facial sizes.
  • Modularized facial recognition pipeline and Developed various standalone facial recognition applications on top of it.
January 2020 - March 2020

Software Engineer Intern

Visual Intelligence Team, Samsung R&D Institute, Bangalore
  • Developed a user-friendly, extensible, and customizable android application to automate the testing of the camera-flashlightmodule and reduce person-hours and cost of testing.
  • Software used: Android Studio, Java.
May 2019 - July 2019

Summer Trainee

HCL Infosystems Limited, Noida
  • Developed a personalized TODO list web application using JSP and servlets, Interfaced MySQL database with it and Deployed it on the WildFly application server.

    Go to Project Page

  • Software used: Java, MySQL, ERDPlus, WildFly.
June 2018 - July 2018

Research Experience

Research Affiliate

Neurocomputing Lab, Indian Institute of Technology, Delhi
January 2021 - Present

Research Assistant

Neurocomputing Lab, Indian Institute of Technology, Delhi
  • Architected a human-inspired, landmark-aware ensemble Facial Expression Recognition Network that improves the current benchmark on the CK+ and JAFFE datasets by 0.51% and 5.34%, respectively; demonstrates high cross-dataset generalization capacity on SFEW dataset; requires only 3.28 MFLOPs for inference.

    Go to Pre-Print

  • Invented a deep learning pipeline for water stress phenotyping of Chickpea plant; the devised spatio-temporal analysis achieves a ceiling level classification performance of 98.52% on JG-62 and 97.78% on Pusa-372 chickpea plant shoot image dataset; outperforms the best reported time-invariant technique by at least 14%; robust to noisy input, with a less than 2.5% dip in average model accuracy and a small standard deviation.

    Go to Publication

  • Designed and carried out neural network simulations for multiple lab projects on physiological signal processing.
  • Software used: Keras, TensorFlow, PyTorch, MATLAB, OpenCV, Google Colab, Overleaf.
July 2020 - January 2021

Undergraduate Researcher

Department of Computer Engineering, Netaji Subhas Insititute of Technology, an affiliate of Delhi University
August 2018 - July 2020

Projects

Face Synthesis using Descriptions Extracted from Unstructured Text

Undergraduate Head Researcher - Bachelor’s Thesis
  • Invented a novel pipeline to generate faces from their corresponding textual description. The motivation was to augment the reading experience for young children, especially those with reading difficulty, by animating characters through facial cues. Go to Publication
  • Developed a crowdsourcing platform and consolidated our Multi-Attributed and Structured Text-to-face (MAST) dataset consisting of structured textual descriptions for face images. Go to Project Page
  • Performed text classification to filter out descriptive sentences from a textual data consolidation of Gutenberg and Face2Text datasets using Bi-LSTM with attention mechanism and achieved 98.5% accuracy and 0.97 F1 score on the test set.
  • Devised an algorithm for fast transformation of an unstructured facial description to a structured one; it has linear complexity with respect to the number of words in the sentence.
  • Trained an Attentional Generative Adversarial Network to synthesize faces from structured descriptions and reported benchmark scores of 54.09 Freechet' s Inception Distance, 1.080 Facial Semantic Distance, and 60.42% Facial Semantic Similarity on our MAST dataset.
  • Software used: Keras, TensorFlow, PyTorch, OpenCV, Google Colab, Microsoft Cognitive Service, Angular Framework, MongoDB, NodeJS, Heroku, Overleaf.
August 2019 - July 2020

GRiD Flipkart Machine learning challenge for Large Scale Object Localization

Flipkart, Bangalore
  • Architected a ResNet-34 inspired model to perform object localization on Flipkart’s large and diverse items dataset.
  • Trained the model on images of size 128x96 (downscaled from VGA to a 1MP camera resolution)
  • Achieved an IoU score of 90.05% on the private test set.
  • Software used: Keras, TensorFlow, OpenCV, Google Colab.
January 2019 - March 2019

Skillset recommendation system to aid engineering aspirants in securing an Internship

Undergraduate researcher - Soft Computing Semester Project
  • Proposed a skill set recommender system to aid engineering aspirants in securing an Internship. Go to Project Page
  • Consolidated a small dataset of various skills an aspiring intern may have and categorized them as generic, company-specific, and domain-specific.
  • Modeled the skill selection problem as a combinatorial optimization problem with multiple objectives and employed a Genetic algorithm (GA) to solve it.
  • Established a Fitness Function to evaluate the fitness of each individual in the chromosome population.
  • Formulated an Objective Function to combine opposing goals of finding the best skillset while minimizing the time to achieve it.
  • Implemented a modular GA pipeline in C++ to evaluate and select the optimum set from the possible combinations of GA operations: population initialization, parent selection, crossover, mutation, survivor selection, and termination.
  • Software used: C++.
August 2018 - December 2018

Game Playing Agents - Tic Tac Toe AI

Undergraduate researcher - Course Project
  • Simulated adversarial games between AI agents on a 3x3 and a 4x4 Tic Tac Toe board.

    Go to Project Page

  • Observed 1-move lookahead provided the best tradeoff between win-draw-loss ratio and time to decide the optimum move, irrespective of boardsize.
  • Software used: Python.
October 2018 - November 2018

Book My Flight - Database Management Semester Project

Department of Computer Engineering, Netaji Subhas Insititute of Technology, an affiliate of Delhi University
  • Implemented a Flight Booking Management system for domestic flights in India.
  • Modeled a MySQL database system with a complex database trigger and recovery mechanism.
  • Designed Java-based user interface.

    Go to Project Page

  • Software used: Java, MySQL, ERDPlus.
August 2017 - December 2017

Selected Hackathon

Project ViSTARa - Reboot the Earth Hackathon

United Nations Technology Innovation Labs, India
  • Developed a web-based AI-powered Learning Management platform to educate rural women through the Self Help Group network in India and empower them to lead the way in green climate initiatives like sustainable agricultural practices. Coverage
August 2019

Peer Reviewed Publications

* Co-First Authors

† Corresponding Author


Awards & Honors


Blogs

Byte-size Information to Chew on

Paper Synopsis Blog Series

  • Synopsis: Multi-Attributed and Structured Text-to-Face Synthesis

    Go to Medium Article

  • Synopsis: Intelligent Monitoring of Stress Induced by Water Deficiency in Plants Using Deep Learning

    Go to Medium Article

  • VQ-GAN & Transformer — Taming Transformers for High-Resolution Image Synthesis: Synopsis

    Go to Medium Article

Project Blogs


Skills

Programming Languages
Research Tools and Frameworks
Development Tools and Frameworks
Cloud Platforms

Hobbies

  • Dissectologist Assembling jigsaw puzzles is a stress buster!
  • Travelling, Music and Food enthusiast
  • Capturing the world through my
  • Learning how to play the Piano

Contact Me

Email:
rohanwadhawan7[AT]gmail[DOT]com

LinkedIn:

GitHub: