Adrià Recasens

Cited by

	All	Since 2019
Citations	3244	3031
h-index	20	19
i10-index	26	25

900

450

225

675

20162017201820192020202120222023202419 51 126 168 237 402 584 738 895

Public access

View all

5 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Antonio TorralbaProfessor of Computer Science, MITVerified email at csail.mit.edu
Aditya KhoslaCTO, Iterative HealthVerified email at iterative.health
Fredo DurandProfessor of Computer Science, MITVerified email at mit.edu
Zoya Bylinskii (Gavrilov)Research Scientist, AdobeVerified email at adobe.com
Aude OlivaSenior Research Scientist, CSAIL, MIT Director MIT-IBM Lab, MIT College Director IndustryVerified email at mit.edu
Carl VondrickAssociate Professor, Columbia UniversityVerified email at columbia.edu
Àgata LapedrizaUniversitat Oberta de Catalunya, Northeastern UniversityVerified email at uoc.edu
Jose M. AlvarezNVIDIAVerified email at nvidia.com
Ali Borjiindependent researcherVerified email at usc.edu
Ariadna QuattonidMetrics, USAVerified email at dmetrics.com
Sotirios KotsopoulosMassachusetts Institute Of TechnologyVerified email at mit.edu

Adrià Recasens

Research Scientist, DeepMind

Verified email at google.com - Homepage

Computer Vision


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	568	2023
Self-supervised multimodal versatile networks JB Alayrac, A Recasens, R Schneider, R Arandjelović, J Ramapuram, ... Advances in Neural Information Processing Systems 33, 25-37, 2020	374	2020
Gaze360: Physically unconstrained gaze estimation in the wild P Kellnhofer, A Recasens, S Stent, W Matusik, A Torralba Proceedings of the IEEE/CVF international conference on computer vision …, 2019	308	2019
Where are they looking? A Recasens Continente, A Khosla, C Vondrick, A Torralba Neural Information Processing Systems Foundation, 2015	265*	2015
Jointly discovering visual objects and spoken words from raw sensory input D Harwath, A Recasens, D Surís, G Chuang, A Torralba, J Glass Proceedings of the European conference on computer vision (ECCV), 649-665, 2018	227	2018
Emotion recognition in context R Kosti, JM Alvarez, A Recasens, A Lapedriza Proceedings of the IEEE conference on computer vision and pattern …, 2017	211	2017
Context based emotion recognition using emotic dataset R Kosti, JM Alvarez, A Recasens, A Lapedriza IEEE transactions on pattern analysis and machine intelligence 42 (11), 2755 …, 2019	199	2019
Where should saliency models look next? Z Bylinskii, A Recasens, A Borji, A Oliva, A Torralba, F Durand Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016	186	2016
Learning to zoom: a saliency-based sampling layer for neural networks A Recasens, P Kellnhofer, S Stent, W Matusik, A Torralba Proceedings of the European conference on computer vision (ECCV), 51-66, 2018	153	2018
Broaden your views for self-supervised video learning A Recasens, P Luc, JB Alayrac, L Wang, F Strub, C Tallec, M Malinowski, ... Proceedings of the IEEE/CVF international conference on computer vision …, 2021	125	2021
Following gaze in video A Recasens, C Vondrick, A Khosla, A Torralba Proceedings of the IEEE International Conference on Computer Vision, 1435-1443, 2017	91	2017
Game Plan: What AI can do for Football, and What Football can do for AI K Tuyls, S Omidshafiei, P Muller, Z Wang, J Connor, D Hennes, I Graham, ... Journal of Artificial Intelligence Research 71, 41-88, 2021	84	2021
Emotic: Emotions in context dataset R Kosti, JM Alvarez, A Recasens, A Lapedriza Proceedings of the IEEE conference on computer vision and pattern …, 2017	70	2017
Towards learning universal audio representations L Wang, P Luc, Y Wu, A Recasens, L Smaira, A Brock, A Jaegle, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	64	2022
Tap-vid: A benchmark for tracking any point in a video C Doersch, A Gupta, L Markeeva, A Recasens, L Smaira, Y Aytar, ... Advances in Neural Information Processing Systems 35, 13610-13626, 2022	63	2022
Multimodal self-supervised learning of general audio representations L Wang, P Luc, A Recasens, JB Alayrac, A Oord arXiv preprint arXiv:2104.12807, 2021	44	2021
Understanding infographics through textual and visual tag prediction Z Bylinskii, S Alsheikh, S Madan, A Recasens, K Zhong, H Pfister, ... arXiv preprint arXiv:1709.09215, 2017	41	2017
Synthetically trained icon proposals for parsing and summarizing infographics S Madan, Z Bylinskii, M Tancik, A Recasens, K Zhong, S Alsheikh, ... arXiv preprint arXiv:1807.10441, 2018	25	2018
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024	22	2024
Breaking microsoft’s CAPTCHA CP Karthik, RA Recasens Technical report, 2015	21	2015

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors