'tacotron' 태그의 글 목록

tacotron 1

논문 제목: MELS-TTS: Multi-emotion Multi-lingual Multi-speaker Text-To-Speech system via disentangled style tokens (2024) 링크논문 저자: Heegin Choi, Jae-Sung Bae, Joun Yeop Lee, Seongkyu Mun, Jihwan Lee, Hoon-Young Cho, Chanwoo Kim 개요개인적으로 느껴지기에 음성합성 (speech synthesis, TTS)은 실제 상용 소프트웨어나 서비스에 많이 접목되어 있을 만큼 그 수준이 어느 정도 사람과 같아졌다고 느껴진다. 그러나 아직 공부를 진행하는 학부생이지만, 감정을 전달하는 능력이나 비언어적 요소들을 정말로 사람처럼 표현할 수 있는지에 대..

딥러닝/음성합성 2024.06.13

moonai

딥러닝 특히 음성 분야에 관심을 갖고 공부하는 학부생

speech synthesis, skip connection, latent filling, ResNet, speaker encoder, timbre, ILSVRC, zero-shot learning, Accuracy, tts, 평가지표, zs-tts, emotional tts, f1score, personalized tts, expressive tts, kaiming he, tacotron, audio augmentation, residual connection,

Today :
Yesterday :

일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

tacotron 1

티스토리툴바