Leveraging Transfer Learning for Voice Cloning in Bengali Language

EasyChair Preprint 15582, version history

VersionDatePagesVersion notes
1
December 16, 2024
14
2
January 10, 2025
14

An irrelavant sentence was included in the previous version unknowingly.

Keyphrases: AI-driven, Bengali, Bengali language, Mean Opinion Score, Mel Spectrograms, Mel-spectrogram, Speaker specific, Voice Cloning, Voice replication, acoustics speech and signal processing, automatic speech recognition and translation, bengali synthesis, cloning with a few samples, encoder synthesizer and vocoder, encoder synthesizer and vocoder components, large bengali speech recognition dataset, low resource languages like bengali, low-resource language, mozilla common voice bengali dataset, predicted vs target mel, recognition and translation for low resource, speaker encoder, speaker s unique voice features, speaker s voice, speaker text to speech synthesis, speaker verification to multi speaker, speech synthesis, text-to-speech, tts models and vocoder combinations, verification to multi speaker text, voice cloning for bengali, vs target mel spectrogram

BibTeX entry
BibTeX does not have the right entry for preprints. This is a hack for producing the correct reference:
@booklet{EasyChair:15582,
  author    = {T Taruneshwaran and Sanjay Chidambaram and Parthvi Manoj and S Sakthi Swaroopan and G Jyothish Lal},
  title     = {Leveraging Transfer Learning for Voice Cloning in Bengali Language},
  howpublished = {EasyChair Preprint 15582},
  year      = {EasyChair, 2025}}