Yuta Nakashima is an associate professor with Institute for Datability Science, Osaka University. His research interests include computer vision, pattern recognition, natural langauge processing, and their applications.
PhD in Engineering, 2012
Video summarization has been one of research topics that require deep understanding of video content. We explore various methods for automatic video summarization and also the limitation of current datasets.
Visual question answering (VQA) with knowledge is a task that requires knowledge to answer questions on images/video. This additional requirement of knowledge poses an interesting challenge on top of the classic VQA tasks.