Shin'ichi Satoh's Lab

JST ASPIRE Program Logo

Welcome to Shin'ichi Satoh's Lab homepage at NII!

Research in our lab focuses on multimedia understanding and knowledge discovery. Especially, we aim to create an intelligent computer system which can see and understand the visual world.

We accept graduate students from Department of Information and Communication Engineering, Graduate School of Information Science and Technology, the University of Tokyo. Our lab is in National Institute of Informatics, Japan.


News

  • [2025.11.30] One paper was accepted by IEEE Access. Congrats Zhaohui!
  • [2025.11.21] One paper was accepted by MTAP. Congrats Yidan!
  • [2025.11.11] One paper was accepted by WACV. Congrats Daniel!
  • [2025.8.21] One paper was accepted by EMNLP. Thanks Shun, Takuto, Toshiki!
  • [2025.7.1] One paper was accepted by IEEE TCSVT. Congrats Jun-Xiu!
  • [2025.5.1] One paper was accepted by ICML. Congrats Zhijing!
  • [2025.4.4] One paper was accepted by IEEE Access. Congrats Zhaohui!
  • [2025.1.1] One paper was accepted by IEEE TMI. Congrats Yansheng!

Research projects

Large-scale fast object detection

We extended R-CNN to larger scale, which enables immediate and accurate object category detection from a large image databas. R. Hinami and S. Satoh, "Large-scale R-CNN with Classifier Adaptive Quantization", ECCV 2016

Multimedia Analytics

Explore, analyze, and visualize archives of multimedia content by bringing together data science and computer vision for the support of real world applications such as social sciences, media studies, and even marketing.

Temporal Matching Kernel with Explicit Feature Maps for Video Event Retrieval

We propose a new video representation for video event retrieval. Given a video query, the method is able to efficiently retrieve similar video events or near-duplicates along with a precise temporal alignment. ``Temporal matching kernel with explicit feature maps,'' ACM Multimedia 2015.

Video Event Detection by Exploiting Word Dependencies

We exploited word dependencies as a new semantic video representation for recognizing complex events S. Phan, Y. Miyao, D-D Le and S Satoh, "Video Event Detection by Exploiting Word Dependencies from Image Captions", COLING 2016