Rif A. Saurous
Rif A. Saurous
Verified email at google.com
TitleCited byYear
CNN architectures for large-scale audio classification
S Hershey, S Chaudhuri, DPW Ellis, JF Gemmeke, A Jansen, RC Moore, ...
2017 ieee international conference on acoustics, speech and signal …, 2017
4082017
Tacotron: Towards end-to-end speech synthesis
Y Wang, RJ Skerry-Ryan, D Stanton, Y Wu, RJ Weiss, N Jaitly, Z Yang, ...
arXiv preprint arXiv:1703.10135, 2017
365*2017
Natural tts synthesis by conditioning wavenet on mel spectrogram predictions
J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ...
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
2922018
Deep probabilistic programming
D Tran, MD Hoffman, RA Saurous, E Brevdo, K Murphy, DM Blei
arXiv preprint arXiv:1701.03757, 2017
1052017
Fixing a broken ELBO
AA Alemi, B Poole, I Fischer, JV Dillon, RA Saurous, K Murphy
arXiv preprint arXiv:1711.00464, 2017
932017
Style tokens: Unsupervised style modeling, control and transfer in end-to-end speech synthesis
Y Wang, D Stanton, Y Zhang, RJ Skerry-Ryan, E Battenberg, J Shor, ...
arXiv preprint arXiv:1803.09017, 2018
782018
Towards end-to-end prosody transfer for expressive speech synthesis with tacotron
RJ Skerry-Ryan, E Battenberg, Y Xiao, Y Wang, D Stanton, J Shor, ...
arXiv preprint arXiv:1803.09047, 2018
722018
Tensorflow distributions
JV Dillon, I Langmore, D Tran, E Brevdo, S Vasudevan, D Moore, B Patton, ...
arXiv preprint arXiv:1711.10604, 2017
422017
Unsupervised learning of semantic audio representations
A Jansen, M Plakal, R Pandya, DPW Ellis, S Hershey, J Liu, RC Moore, ...
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
312018
Voicefilter: Targeted voice separation by speaker-conditioned spectrogram masking
Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ...
arXiv preprint arXiv:1810.04826, 2018
292018
Trainable frontend for robust and far-field keyword spotting
Y Wang, P Getreuer, T Hughes, RF Lyon, RA Saurous
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
282017
Scalable learning of non-decomposable objectives
EET Eban, M Schain, A Mackey, A Gordon, RA Saurous, G Elidan
arXiv preprint arXiv:1608.04802, 2016
252016
An information-theoretic analysis of deep latent-variable models
A Alemi, B Poole, I Fischer, J Dillon, RA Saurus, K Murphy
242018
Simple, distributed, and accelerated probabilistic programming
D Tran, MW Hoffman, D Moore, C Suter, S Vasudevan, A Radul
Advances in Neural Information Processing Systems, 7598-7609, 2018
142018
Uncovering latent style factors for expressive speech synthesis
Y Wang, RJ Skerry-Ryan, Y Xiao, D Stanton, J Shor, E Battenberg, ...
arXiv preprint arXiv:1711.00520, 2017
132017
Exploring tradeoffs in models for low-latency speech enhancement
K Wilson, M Chinen, J Thorpe, B Patton, J Hershey, RA Saurous, ...
2018 16th International Workshop on Acoustic Signal Enhancement (IWAENC …, 2018
102018
Neumann optimizer: A practical optimization algorithm for deep neural networks
S Krishnan, Y Xiao, RA Saurous
arXiv preprint arXiv:1712.03298, 2017
72017
Towards learning semantic audio representations from unlabeled data
A Jansen, M Plakal, R Pandya, DPW Ellis, S Hershey, J Liu, RC Moore, ...
signal 2 (3), 7-11, 2017
72017
On using backpropagation for speech texture generation and voice conversion
J Chorowski, RJ Weiss, RA Saurous, S Bengio
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
62018
Ultrasonic communication using consumer hardware
P Getreuer, C Gnegy, RF Lyon, RA Saurous
IEEE Transactions on Multimedia 20 (6), 1277-1290, 2017
52017
The system can't perform the operation now. Try again later.
Articles 1–20