Temporal action detection using a statistical language model A Richard, J Gall Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 259 | 2016 |
Weakly supervised action learning with rnn based fine-to-coarse modeling A Richard, H Kuehne, J Gall Proceedings of the IEEE conference on Computer Vision and Pattern …, 2017 | 253 | 2017 |
When will you do what?-anticipating temporal occurrences of activities Y Abu Farha, A Richard, J Gall Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 224 | 2018 |
Meshtalk: 3d face animation from speech using cross-modality disentanglement A Richard, M Zollhöfer, Y Wen, F De la Torre, Y Sheikh Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 210 | 2021 |
Neuralnetwork-viterbi: A framework for weakly supervised video learning A Richard, H Kuehne, A Iqbal, J Gall Proceedings of the IEEE conference on Computer Vision and Pattern …, 2018 | 164 | 2018 |
Conditional diffusion probabilistic model for speech enhancement YJ Lu, ZQ Wang, S Watanabe, A Richard, C Yu, Y Tsao ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 160 | 2022 |
Weakly supervised learning of actions from transcripts H Kuehne, A Richard, J Gall Computer Vision and Image Understanding 163, 78-89, 2017 | 145 | 2017 |
Action sets: Weakly supervised action segmentation without ordering constraints A Richard, H Kuehne, J Gall Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018 | 110 | 2018 |
A hybrid rnn-hmm approach for weakly supervised temporal action segmentation H Kuehne, A Richard, J Gall IEEE transactions on pattern analysis and machine intelligence 42 (4), 765-779, 2018 | 103 | 2018 |
Audio-and gaze-driven facial animation of codec avatars A Richard, C Lea, S Ma, J Gall, F De la Torre, Y Sheikh Proceedings of the IEEE/CVF winter conference on applications of computer …, 2021 | 89 | 2021 |
Mean-normalized stochastic gradient for large-scale deep learning S Wiesler, A Richard, R Schlüter, H Ney 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 86 | 2014 |
Audiodec: An open-source streaming high-fidelity neural audio codec YC Wu, ID Gebru, D Markoviæ, A Richard ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 71 | 2023 |
Multiface: A dataset for neural face rendering C Wuu, N Zheng, S Ardisson, R Bali, D Belko, E Brockmeyer, L Evans, ... arXiv preprint arXiv:2207.11243, 2022 | 64 | 2022 |
Neural Synthesis of Binaural Speech From Mono Audio A Richard, D Markovic, ID Gebru, S Krenn, GA Butler, F Torre, Y Sheikh International Conference on Learning Representations, 2021 | 62 | 2021 |
RASR/NN: The RWTH neural network toolkit for speech recognition S Wiesler, A Richard, P Golik, R Schlüter, H Ney 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 61 | 2014 |
A bag-of-words equivalent recurrent neural network for action recognition A Richard, J Gall Computer Vision and Image Understanding 156, 79-91, 2017 | 49 | 2017 |
Audio-visual speech codecs: Rethinking audio-visual speech enhancement by re-synthesis K Yang, D Markoviæ, S Krenn, V Agrawal, A Richard Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 38 | 2022 |
Deep impulse responses: Estimating and parameterizing filters with deep networks A Richard, P Dodds, VK Ithapu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 36 | 2022 |
Implicit hrtf modeling using temporal convolutional networks ID Gebru, D Markoviæ, A Richard, S Krenn, GA Butler, F De la Torre, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 32 | 2021 |
From audio to photoreal embodiment: Synthesizing humans in conversations E Ng, J Romero, T Bagautdinov, S Bai, T Darrell, A Kanazawa, A Richard Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 25 | 2024 |