Ronghang Hu

Cited by

	All	Since 2019
Citations	6559	5726
h-index	22	20
i10-index	25	24

1700

850

425

1275

201520162017201820192020202120222023202430 96 239 404 632 802 877 1067 1631 715

Public access

View all

12 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Trevor DarrellProfessor of Computer Science, U.C. BerkeleyVerified email at eecs.berkeley.edu
Marcus RohrbachProfessor for Multimodal Reliable AI, TU Darmstadt, GermanyVerified email at tu-darmstadt.de
Kate SaenkoBoston UniversityVerified email at bu.edu
Jacob AndreasMITVerified email at mit.edu
Anna RohrbachProfessor, TU Darmstadt, GermanyVerified email at tu-darmstadt.de
Amanpreet SinghContextual AIVerified email at contextual.ai
Xinlei ChenFAIR, MetaVerified email at meta.com
Daniel FriedCarnegie Mellon UniversityVerified email at cs.cmu.edu
Ross GirshickResearch Scientist, Allen Institute for Artificial Intelligence (AI2)Verified email at allenai.org
Kaiming HeAssociate Professor, EECS, MITVerified email at mit.edu
Judy HoffmanAssistant Professor, Georgia TechVerified email at gatech.edu
Saining XieAssistant Professor at the Courant Institute, New York UniversityVerified email at nyu.edu
Shoubhik DebnathFAIR, AI at MetaVerified email at fb.com
Lisa Anne M HendricksDeepMindVerified email at google.com
Zeynep AkataProfessor at TUM and Director at Helmholtz MunichVerified email at helmholtz-munich.de
Jiashi FengByteDance Inc.Verified email at bytedance.com
Huazhe XuTsinghua UniversityVerified email at berkeley.edu
Bernt SchieleProfessor, Max Planck Institute for Informatics, Saarland Informatics Campus, Saarland UniversityVerified email at mpi-inf.mpg.de
Volkan CirikASAPPVerified email at asapp.com
Louis-Philippe MorencyAssociate professor, Carnegie Mellon UniversityVerified email at cs.cmu.edu

Ronghang Hu

Research Scientist, Meta AI

Verified email at meta.com - Homepage

Computer Vision Natural Language Processing Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Learning to reason: End-to-end module networks for visual question answering R Hu, J Andreas, M Rohrbach, T Darrell, K Saenko Proceedings of the IEEE international conference on computer vision, 804-813, 2017	664	2017
Natural language object retrieval R Hu, H Xu, M Rohrbach, J Feng, K Saenko, T Darrell Proceedings of the IEEE conference on computer vision and pattern …, 2016	620	2016
Grounding of textual phrases in images by reconstruction A Rohrbach, M Rohrbach, R Hu, T Darrell, B Schiele Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016	533	2016
Speaker-follower models for vision-and-language navigation D Fried, R Hu, V Cirik, A Rohrbach, J Andreas, LP Morency, ... Advances in neural information processing systems 31, 2018	477	2018
Flava: A foundational language and vision alignment model A Singh, R Hu, V Goswami, G Couairon, W Galuba, M Rohrbach, D Kiela Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	461	2022
Modeling relationships in referential expressions with compositional modular networks R Hu, M Rohrbach, J Andreas, T Darrell, K Saenko Proceedings of the IEEE conference on computer vision and pattern …, 2017	404	2017
Segmentation from natural language expressions R Hu, M Rohrbach, T Darrell Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016	392	2016
LSDA: Large scale detection through adaptation J Hoffman, S Guadarrama, ES Tzeng, R Hu, J Donahue, R Girshick, ... Advances in neural information processing systems 27, 2014	378	2014
Convnext v2: Co-designing and scaling convnets with masked autoencoders S Woo, S Debnath, R Hu, X Chen, Z Liu, IS Kweon, S Xie Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	352	2023
Learning to segment every thing R Hu, P Dollár, K He, T Darrell, R Girshick Proceedings of the IEEE conference on computer vision and pattern …, 2018	345	2018
UniT: Multimodal Multitask Learning with a Unified Transformer R Hu, A Singh arXiv preprint arXiv:2102.10772, 2021	322	2021
Textcaps: a dataset for image captioning with reading comprehension O Sidorov, R Hu, M Rohrbach, A Singh Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020	249	2020
Grounding visual explanations L Anne Hendricks, R Hu, T Darrell, Z Akata Proceedings of the European Conference on Computer Vision (ECCV), 264-279, 2018	226	2018
Iterative answer prediction with pointer-augmented multimodal transformers for textvqa R Hu, A Singh, T Darrell, M Rohrbach Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020	211	2020
Explainable neural computation via stack neural module networks R Hu, J Andreas, T Darrell, K Saenko Proceedings of the European conference on computer vision (ECCV), 53-69, 2018	211	2018
Language-conditioned graph networks for relational reasoning R Hu, A Rohrbach, T Darrell, K Saenko Proceedings of the IEEE/CVF international conference on computer vision …, 2019	178	2019
Scaling language-image pre-training via masking Y Li, H Fan, R Hu, C Feichtenhofer, K He Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	173	2023
Generating counterfactual explanations with natural language LA Hendricks, R Hu, T Darrell, Z Akata arXiv preprint arXiv:1806.09809, 2018	104	2018
Are You Looking? Grounding to Multiple Modalities in Vision-and-Language Navigation R Hu, D Fried, A Rohrbach, D Klein, T Darrell, K Saenko arXiv preprint arXiv:1906.00347, 2019	85	2019
Worldsheet: Wrapping the world in a 3d sheet for view synthesis from a single image R Hu, N Ravi, AC Berg, D Pathak Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021	69	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors