Jing Yu Koh

I am a first year PhD student in the Machine Learning Department in the School of Computer Science at Carnegie Mellon University. I work on grounded language understanding and multi-modal learning.

Prior to this, I was a Research Engineer at Google Research, where I worked on vision-and-language problems and multi-modal generative models.

Before joining Google, I completed my undergraduate studies at the Singapore University of Technology and Design with summa cum laude (highest honors) in 2019.

My first name is "Jing Yu" and informally I go by the nickname "JY".

News


  • (July 2022) After 2.73 wonderful years at Google, I've left to pursue my PhD at Carnegie Mellon University!
  • (January 2022) 1 paper accepted to ICLR 2022!
  • (December 2021) Serving as a reviewer for CVPR 2022.
  • (July 2021) 1 paper accepted to ICCV 2021!
  • (July 2021) Presenting an invited talk at Microsoft Research.
  • (July 2021) Serving as a reviewer for NeurIPS 2021.
  • (March 2021) 1 paper accepted to CVPR 2021!
  • (January 2021) 1 paper accepted to ICLR 2021!
  • (October 2020) 1 paper accepted to WACV 2021!
  • (July 2020) 1 paper accepted to ECCV 2020!
  • (October 2019) Officially joined Google Research in Mountain View, California.

Selected Publications [Google Scholar]


2022

Scaling Autoregressive Models for Content-Rich Text-to-Image Generation

Jiahui Yu, Yuanzhong Xu, Jing Yu Koh, Thang Luong, Gunjan Baid, Zirui Wang, Vijay Vasudevan, Alexander Ku, Yinfei Yang, Burcu Karagol Ayan, Ben Hutchinson, Wei Han, Zarana Parekh, Xin Li, Han Zhang, Jason Baldridge, Yonghui Wu

In submission, 2022.

Simple and Effective Synthesis of Indoor 3D Scenes

Jing Yu Koh*, Harsh Agrawal*, Dhruv Batra, Richard Tucker, Austin Waters, Honglak Lee, Yinfei Yang, Jason Baldridge, Peter Anderson (* denotes equal contribution)

In submission, 2022.

2021

Pathdreamer: A World Model for Indoor Navigation

Jing Yu Koh, Honglak Lee, Yinfei Yang, Jason Baldridge, Peter Anderson

In The International Conference on Computer Vision (ICCV), 2021.

Vector-quantized Image Modeling with Improved VQGAN

Jiahui Yu, Xin Li, Jing Yu Koh, Han Zhang, Ruoming Pang, James Qin, Alexander Ku, Yuanzhong Xu, Jason Baldridge, Yonghui Wu

In The International Conference on Learning Representations (ICLR), 2022.

PDF

Cross-Modal Contrastive Learning for Text-to-Image Generation

Han Zhang*, Jing Yu Koh*, Jason Baldridge, Honglak Lee, Yinfei Yang (* denotes equal contribution)

In The Conference on Computer Vision and Pattern Recognition (CVPR), 2021.

Text-to-Image Generation Grounded by Fine-Grained User Attention

Jing Yu Koh, Jason Baldridge, Honglak Lee, Yinfei Yang

In The IEEE Winter Conference on Applications of Computer Vision (WACV), 2021.

2020

SideInfNet: A Deep Neural Network for Semi-Automatic Semantic Segmentation with Side Information

Jing Yu Koh, Duc Thanh Nguyen, Quang-Trung Truong, Sai-Kit Yeung, Alexander Binder

In The European Conference on Computer Vision (ECCV), 2020.


Projects


ModelZoo

Model Zoo curates pre-trained deep learning models and code, making it easy for researchers to find models for various frameworks.

Web Application
Yu Sheng

The CNY Yusheng app is a reference app designed to help you learn more about unique Chinese New Year traditions and the yusheng dish.

iOS Application
Pixel Warrior

Castle-defense strategy game for iOS. Permanent death and level randomisation make no two games alike.

iOS Game