Publications

abv

(2019). Multimodal learning analytics: Society 5.0 project in Japan. Proc. 9th International Conference on Learning Analytics and Knowledge (LAK).

(2018). Visually grounded paraphrase extraction via phrase grounding. Proc. Workshop on Language and Vision at CVPR.

(2017). Fine-grained video retrieval for multi-clip video. Proc. Workshop on Closing the Loop Between Vision and Language (CLVL) at ICCV.

(2017). Video question answering to find a desired video segment. Proc. Open Knowledge Base and Question Answering Workshop (OKBQA) at SIGIR.

(2016). Video summarization using deep semantic features. Proc. 13th Asian Conference on Computer Vision (ACCV).

PDF

(2015). Textual description-based video summarization for video blogs. Proc. 2015 IEEE International Conference on Multimedia and Expo.

PDF DOI

(2015). Facial expression preserving privacy protection using image melding. Proc. 2015 IEEE International Conference on Multimedia and Expo (ICME).

DOI

(2013). Inferring what the videographer wanted to capture. Proc. 2013 IEEE International Conference on Image Processing (ICIP).

DOI

(2012). Markov random field-based real-time detection of intentionally-captured persons. Proc. 19th IEEE International Conference on Image Processing (ICIP).

DOI

(2011). Extracting intentionally captured regions using point trajectories. Proc. 19th ACM International Conference on Multimedia (ACM MM).

PDF DOI

(2011). Automatic generation of privacy-protected videos using background estimation. Proc. 2011 IEEE International Conference on Multimedia and Expo (ICME).

DOI

(2010). Discriminating intended human objects in consumer videos. Proc. 20th International Conference on Pattern Recognition (ICPR).

DOI

(2010). Digital Diorama: Sensing-based real-world visualization. Proc. International Conference on Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU).

(2010). Detecting intended human objects in human-captured videos. Proc. 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

DOI

(2009). Digital Diorama: Real-time adaptive visualization of public spaces. Proc. 1st International Conference on Security Camera Network, Privacy Protection and Community Safety (SPC).

(2007). Maximum-likelihood estimation of recording position based on audio watermarking. Proc. Third International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIHMSP).

PDF DOI

(2007). Determining recording location based on synchronization positions of audio watermarking. Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

PDF DOI