Multimedia content analysis and synthesis is at the core of our research, with a particular focus on visual and text modality. These are some of the current ongoing projects and thesis directions.
Multimodal Skill Assessment
- Communication and Presentation skill assessment using multimodal analysis (Sowmya Rasipuram, completed and Chinchu Thomas PhD thesis, completed)
- Handwritten Essay grading (Annapurna PhD thesis, completed)
- AI for Social Psychology (Kumar Shubham MS thesis, Rahil iMTech Thesis completed)
- Dancing skill assessment (Pooja Venkatesh MS thesis, completed)
- Autism Behavior Assessment (MINRO Funding, Jeba Berlin, Sarthak iMTech Thesis) [CURRENT]
Multimodal Conversational Systems:
- Follow up question generating interviewing Agent (Pooja Rao MS, PhD Thesis)
- Margadarshi (UPSC interviewing agent, internally funded project, Laxmi Narayen RA) [CURRENT]
- Interviewing agents (Jinal MS Thesis) [CURRENT]
GANs and Multimedia Synthesis:
- Controlled behavioral video generation (Kumar Shubham MS Thesis, Swasti iMTech Thesis, Anirban MS Thesis) [CURRENT]
- ISL Gesture Synthesis [Shyam MS Thesis, completed]