About Me

Hi! I am Shijia Yang, a second-year Master's student in Computer Science at Stanford University. I was advised by Professor Kurt Keutzer at BAIR Lab and Professor Masayoshi Tomizuka at MSC Lab.
My goal is to extend human intelligence and physical power beyond the limit. Therefore, I hope to build a robust and intelligent robotic perception system. My specific interest is to explore methods that properly and efficiently utilizing multimodal data, including image, language, point-cloud, and high dimension variant of them, to create realiable representations for machine perception. I also love to explore other interesting directions.
News: I am currently researching on vision-language alignment. Actively looking for communication and collaboration!

Publications

Law of Vision Representation in MLLMs

Shijia Yang, Bohan Zhai, Quanzeng You, Jianbo Yuan, Hongxia Yang, Chenfeng Xu.

HallE-Control: Controlling Object Hallucination in Large Vision Language Models

Shijia Yang*, Bohan Zhai*, Chenfeng Xu, Sheng Shen, Kurt Keutzer, Manling Li.

Multitask Vision-Language Prompt Tuning

Shijia Yang*, Sheng Shen*, Tianjun Zhang*, Bohan Zhai, Joseph E. Gonzalez, Kurt Keutzer, Trevor Darrell.

WACV 2024.


Time Will Tell: New Outlooks and A Baseline for Temporal Multi-view 3D Object Detection

Jinhyung Park*, Chenfeng Xu*, Shijia Yang, Kurt Keutzer, Kris Kitani, Masayoshi Tomizuka, Wei Zhan.

ICLR 2023 Oral.

Image2Point: 3D Point-Cloud Understanding with 2D Image Pretrained Models

Shijia Yang*, Chenfeng Xu*, Tomer Galanti, Bichen Wu, Xiangyu Yue, Bohan Zhai, Wei Zhan, Peter Vajda, Kurt Keutzer, Masayoshi Tomizuka.

ECCV 2022.

Curriculum Vitae