Portfolio

Multi-Modal Domain Fusion for Multi-Modal Aerial View Object Classification

Our solution to the CVPR Perception Beyond Visible Spectrum Workshop 2022: Multi-Modal Aerial View Object Classification challenge. We proposed a novel Multi-Modal Domain Fusion(MDF) network and a training scheme to learn the domain invariant features from long-tailed multi-modal data distribution by reducing the domain discrepancy using Wasserstein distance and use it to accurately classify the aerially viewed objects. To make the learning more efficient, class-conditional domain adaptation was achieved by passing the multi-modal data in pairs at a time to the two models in MDF, effectively making it a shared feature learner.
Sumanth V Udupa, Aniruddh Sikdar, Suresh Sundaram
paper code

Deep Learning Based Localization and Control of Indoor Robots

	Tackled the perception and control of an autonomous robot working in an indoor environment. Report code

Estimation and Control of swarm of robots in an abstract space

	Implemented obstacle avoidance along with a simple formation using a simple control law to depict swarm behavior of robots using ROS Melodic and Gazebo(3D Open Source Simulator). Report code video

Robotic Process Automation using Swarm Behaviour

	Coordinated multiple robots for precise localization and navigation using ROS Navigation Stack, orchestrating seamless pick-and-place operations, and implemented accurate environmental mapping using Google Cartographer. Report code

QR code scanning Quadcopter

	Transmission of live video feed from raspberry pi to another processor via TCP/IP using socket library in Python for IIT Bombay Drone Challenge. code