IISC BTIRC

Publications from the BTIRC project [2020-2024]

2024

Learning to Switch off, Switch on, and Integrate Modalities in Large Pretrained Transformers Tejas Duseja, Annervaz K. M, Jeevithiesh Duggani, Shyam Zacharia, Michael Free and Ambedkar Dukkipati IEEE International Conference on Multimedia Information Processing and Retrieval: 2024.

Kalluri, Shareef Babu, Prachi Singh, Pratik Roy Chowdhuri, Apoorva Kulkarni, Shikha Baghel, Pradyoth Hegde, Swapnil Sontakke, S. R. Prasanna, Deepu Vijayasenan, and Sriram Ganapathy. "The Second DISPLACE Challenge: DIarization of SPeaker and LAnguage in Conversational Environments." arXiv preprint arXiv:2406.09494 (2024). (Accepted at Interspeech 2024)

Baghel, Shikha, Shreyas Ramoji, Somil Jain, Pratik Roy Chowdhuri, Prachi Singh, Deepu Vijayasenan, and Sriram Ganapathy. "Summary of the DISPLACE challenge 2023-DIarization of SPeaker and LAnguage in Conversational Environments." Speech Communication 161 (2024): 103080.

Dutta, Soumya, and Sriram Ganapathy. "Zero Shot Audio To Audio Emotion Transfer With Speaker Disentanglement." In ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 10371-10375. IEEE, 2024.

Singh, Prachi, and Sriram Ganapathy. ""Overlap-aware End-to-End Supervised Hierarchical Graph Clustering for Speaker Diarization."" arXiv preprint arXiv:2401.12850 (2024) (under review TASLP).

2023

Singh, Prachi, Amrit Kaul, and Sriram Ganapathy. ""Supervised hierarchical clustering using graph neural networks for speaker diarization."" In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1-5. IEEE, 2023.

1 Sep 2022 - 31 Aug 2023

Conference papers

Srikanth Raj Chetupalli, Sriram Ganapathy, "Speaker conditioned acoustic modeling for multi-speaker conversational ASR", INTERSPEECH, September 2022.
Pavan K Gadamsetty, KVS Hari, A Fast Dictionary Learning Algorithm for CSI Feedback in Massive MIMO FDD Systems, 2023 National Conference on Communications (NCC), 1-6, Feb 2023
Prachi Singh, Amrit Kaul, and Sriram Ganapathy, “Supervised Hierarchical Clustering using Graph Neural Networks for Speaker Diarization,” in Proc. ICASSP, 2023.

Journal papers published

Pavan K Gadamsetty, KVS Hari, L Hanzo , Learning a Common Dictionary for CSI Feedback in FDD Massive MU-MIMO-OFDM Systems, IEEE Open Journal of Vehicular Technology, 2023
M Francis, F Mehran, KVS Hari, Analysis of Selective User Forwarding in Cell-Free Massive MIMO with Channel Aging, IEEE Access, 2023

Journal papers submitted

A. Mukhopadhyay, N. R. Talwar, H. Vishwakarma, G. S. R. Reddy, S. Srivastava, A. Pena-Rios, and P. Biswas, VR Digital Twin of Office Space with Computer Vision based Estimation of Room Occupancy and Power Consumption, Automation in Construction, Elsevier. [Under Review]
S. Patel, A. Mukhopadhyay, P. Sharma, G. S. R. Reddy, and P. Biswas, Data-Driven Digital Twin Construction: Real-Time Mapping for Accurate Replication, Virtual Reality and Intelligent Hardware, Elsevier. [Under Review]

2022

Srikanth Raj Chetupalli, Sriram Ganapathy, ""Speaker conditioned acoustic modeling for multi-speaker conversational ASR"", INTERSPEECH, September 2022.

1 Sep 2021 - 31 Aug 2022

Journal papers

Pavan K Gadamsetty, KVS Hari, L Hanzo , “Learning a Common Dictionary for CSI Feedback in FDD Massive MU-MIMO-OFDM Systems”, IEEE Open Journal of Vehicular Technology, 2023
M Francis, F Mehran, KVS Hari, “Analysis of Selective User Forwarding in Cell-Free Massive MIMO with Channel Aging”, IEEE Access, 2023
Rana Kumar Jana, Anand Srivastava, Andrew Lord, Abhijit Mitra, “Optical cable deployment versus fibre leasing: an operator’s perspective on Capex savings for capacity upgrade in an elastic optical core network”, Journal of Optical Communications and Networking, Volume 15, Issue , 2023/6/22
Rana Kumar Jana, Bijoy Chand Chatterjee, Abhishek Pratap Singh, Anand Srivastava, Biswanath Mukherjee, Andrew Lord, Abhijit Mitra, “Quality-aware resource provisioning for multiband elastic optical networks: a deep-learning-assisted approach”, Journal of Optical Communications and Networking, Volume 14, Issue 11 , 2022/11/1

Conference Papers

Srikanth Raj Chetupalli, Sriram Ganapathy, "Speaker conditioned acoustic modeling for multi-speaker conversational ASR", INTERSPEECH Conference, September 2022.
Pavan K Gadamsetty, KVS Hari, A Fast Dictionary Learning Algorithm for CSI Feedback in Massive MIMO FDD Systems, 2023 National Conference on Communications (NCC), 1-6, Feb 2023
Prachi Singh, Amrit Kaul, and Sriram Ganapathy, “Supervised Hierarchical Clustering using Graph Neural Networks for Speaker Diarization,” in Proc. ICASSP, 2023.
Effect of Fill Margin on Network Survivability for C+L Band Optical Networks” Rana Kumar Jana, Andrew Lord, Anand Srivastava, Abhijit Mitra, ECOC Conference, October 2023 in Glasgow, UK

2021

Singh, Prachi, and Sriram Ganapathy. ""Self-supervised metric learning with graph clustering for speaker diarization."" In 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pp. 90-97. IEEE, 2021.

Singh, Prachi, Rajat Varma, Venkat Krishnamohan, Srikanth Raj Chetupalli, and Sriram Ganapathy. ""LEAP submission for the third dihard diarization challenge."" in Proc Interspeech 2021.

P. Singh, S. Ganapathy, ""Self-supervised Representation Learning With Path Integral Clustering For Speaker Diarization,"" IEEE Transactions and Audio, Speech and Language Processing, 2021.

2020

Prachi Singh, Sriram Ganapathy, “Deep self-supervised hierarchical clustering for speaker diarization”, INTERSPEECH 2020.

Srikanth Raj Chetupalli, Sriram Ganapathy, “Context-Dependent RNNLM for Automatic Transcription of Conversations”, INTERSPEECH 2020.

A. Mukhopadhyay, GS Rajshekar Reddy, KPS Saluja, S. Ghosh, A. Peña-Rios, G. K. Gopal, P. Biswas, A Virtual Reality-Based Digital Twin of Office Spaces with Social Distance Measurement Feature, Virtual Reality & Intelligent Hardware 3 (5), Elsevier
A. Mukhopadhyay, GS Rajshekar Reddy, S. Ghosh, P. Biswas, Validating Social Distancing through Deep Learning and VR-Based Digital Twins , ACM VRST 2021
P. Singh and S. Ganapathy Deep Self-Supervised Hierarchical Clustering for Speaker Diarization, Interspeech 2020, Beijing, October 2020. - S. R. Chetupalli and S. Ganapathy, Context Dependent RNNLM for Automatic Transcription of Conversations", Interspeech 2020, Beijing, October 2020.

Conference papers submitted and published during 1 Sep 2020-31 Aug 2021

R. Hazra, P.Dutta, S. Gupta, M. A. Qaathir and A. Dukkipati, Active^2 Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation. In Proceedings of NAACL: 2021.

Abhishek Mukhopadhyay, GS Rajshekar Reddy, Imon Mukherjee, Gokul Kumar Gopal, Anasol Peña-Rios, Pradipta Biswas. 2021. Generating Synthetic Data for Deep Learning using VR Digital Twin. In Proceedings of the 3rd International Conference on Virtual Reality and Image Processing (VRIP2021), Singapore.
BTIRC – IIITD papers Published (1 Sep 2019-31 Aug 2021)

A.Mitra, D. Semrau, N. Gahlawat, A. Pradhan, A. Srivastava, P. Bayvel, A. Lord, “Capacity Benefits over C+L Band Elastic Optical Nework in Indian Network Scenario”, Proc of IEEE Intl. Conf on ANTS, Dec 2019

A. Mitra, R. Jana, A. Pradhan, A. Srivastava, B. Mukherjee and A. Lord, “When is operation over C+L Bands more economical than multifiber for capacity upgrade of an optical backbone network?,”European Conference on Optical Communication (ECOC), Brussels, Belgium, December 2020,

T. Ahmed, S.Rahman, A.Pradhan, A.Mitra, M.Tornatore, A.Lord, B.Mukherjee, " C to C+L Bands Upgrade with Resource Re provisioning in Optical Backbone Networks," Optical Fiber Conference (OFC), June 2021, Online, paper 3560423.

Publications