Home | People | Research | Publications | Downloads & Demos | Links
Jump to 2017 | 2016 | 2015 | 2014 | 2013 | 2012 | 2011 | 2010 | 2009 | 2008 | 2007 | 2006 | 2005 | 2004 | 2003 | 2002 | 2001 | 2000 | 1999 | 1998 and earlier

2017


  • Cross-modal Recipe Retrieval: How to Cook This Dish ?
    J. J. Chen, Lei Pang, C. W. Ngo, International Conference on Multimedia Modeling (MMM), Reykjavik, Iceland, January 2017. (Oral)

  • Concept-Based Interactive Search System
    Y. J. Lu, P. A. Nguyen, H. Zhang, C. W. Ngo, International Conference on Multimedia Modeling (MMM), Reykjavik, Iceland, January 2017.

2016


  • Hyperlink-Aware Object Retrieval
    W. Zhang, C. W. Ngo, X. C. Cao, IEEE Trans. on Image Processing (TIP), to appear, 2016.

  • Automatic Hookworm Detection in Wireless Capsule Endoscopy Images
    X. Wu, H. H. Chen, T. Gan, J. Z. Chen, C. W. Ngo, Q. Peng, IEEE Trans. on Medical Imaging, vol. 35, no. 7, pp. 1741-1752, 2016.

  • Detection of Bird Nests in Overhead Catenary System Images for High-Speed Rail
    X. Wu, P. Yuan, Q. Peng, C. W. Ngo, J. Y. He, Pattern Recognition (PR), vol. 51, pp. 242-254, 2016.

  • Fast Covariant VLAD for Image Search
    W. L. Zhao, C. W. Ngo, H. Z. Wang, IEEE Trans. on Multimedia (TMM), to appear, 2016.

  • On the use of commonsense ontology for multimedia event recounting
    C. C. Tan, C. W. Ngo, International Journal of Multimedia Information Retrieval (IJMIR), vol. 5, no. 2, pp. 73-88, 2016.

  • Object Pooling for Multimedia Event Detection and Evidence Localization
    H. Zhang, C. W. Ngo, ITE Trans. on Media Technology and Applications, vol. 4, no. 3, pp. 218-226, 2016.

  • Deep-based Ingredient Recognition for Cooking Recipe Retrieval
    J. J. Chen, C. W. Ngo, ACM Multimedia (ACM MM), Amsterdam, Netherlands, October 2016. (Ora) [VIREO Food-172]

  • Event Detection with Zero Example: Select the Right and Suppress the Wrong Concepts
    Y. J. Lu, H. Zhang, M. D. Boer, C. W. Ngo, ACM International Conference on Multimedia Retrieval (ICMR), New York, USA, June 2016. (Oral)

  • Serendipity-driven Celebrity Video Hyperlinking
    S. J. Yang, L. Pang, C. W. Ngo, ACM International Conference on Multimedia Retrieval (ICMR), New York, USA, June 2016. (Demo)

  • VIREO @ TRECVID 2016: Multimedia Event Detection, Ad-hoc Video Search, Video-to-Text Description
    H. Zhang, L. Pang, Y. J. Lu, C. W. Ngo, NIST TRECVID Workshop (TRECVID'16), Gaithersburg, USA, Nov 2016.

2015


  • Human Action Recognition in Unconstrained Videos by Explicit Motion Modeling
    Y. G. Jiang, Q. Dai, W. Liu, X. Y. Xue, C. W. Ngo, IEEE Trans. on Image Processing (TIP), vol. 24, no. 11, pp. 3781-3795, 2015.

  • Topological Spatial Verification for Instance Search
    W. Zhang, C. W. Ngo, IEEE Trans. on Multimedia (TMM), vol. 17, no. 8, pp. 1236-1247, 2015.

  • Deep Multimodal Learning for Affective Analysis and Retrieval
    L. Pang, Shiai Zhu, C. W. Ngo, IEEE Trans. on Multimedia (TMM), vol. 17, no. 11, pp. 2008-2020, 2015

  • Opinion Question Answering by Sentiment Clip Localization
    L. Pang, C. W. Ngo, ACM Trans. on Multimedia Computing, Communications, and Applications, vol. 12, no. 2, pp. 31:1-31:19, 2015.

  • Unsupervised Celebrity Face Naming in Web Videos
    L. Pang, C. W. Ngo, IEEE Trans. on Multimedia (TMM), vol. 17, no. 6, pp. 854-866, 2015.

  • Learning Query and Image Similarities with Ranking Canonical Correlation Analysis
    T. Yao, T. Mei, C. W. Ngo, IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, December 2015. (Oral)

  • Semi-supervised Hashing with Semantic Confidence for Large Scale Visual Search
    Y. W. Pan, T. Yao, H. Q. Li, C. W. Ngo, T. Mei, ACM Conference on Research and Development in Information Retrieval (SIGIR), Santiago, Chile, August 2015. (Oral)

  • Semi-supervised Domain Adaptation with Subspace Learning for Visual Recognition
    T. Yao, Y. W. Pan, C. W. Ngo, H. Q. Li, T. Mei, Computer Vision and Pattern Recognition (CVPR), Boston, USA, June 2015.

  • Multimodal Learning with Deep Boltzmann Machine for Emotion Prediction in User Generated Videos
    L. Pang, C. W. Ngo, ACM International Conference on Multimedia Retrieval (ICMR), Shanghai, China, June 2015.

  • Improving Automatic Name-Face Association using Celebrity Images on the Web
    Z. N. Chen, B. L. Feng, C. W. Ngo, C. Y. Jia, X. S. Huang, ACM International Conference on Multimedia Retrieval (ICMR), Shanghai, China, June 2015.

  • VIREO-TNO @ TRECVID 2015: Multimedia Event Detection
    H. Zhang, Y. J. Lu, M. D. Boer, F. T. Haar, Z. F. Qiu, K. Schutte, W. Kraaij, C. W. Ngo, NIST TRECVID Workshop (TRECVID'15), Gaithersburg, USA, Nov 2015.

  • VIREO @ TRECVID 2015: Video Hyperlinking (LNK)
    L. Pang, C. W. Ngo, NIST TRECVID Workshop (TRECVID'15), Gaithersburg, USA, Nov 2015.

2014


  • Visual Typo Correction by Collocative Optimization - A Case Study on Merchandize Images
    X. Y. Wei, Z. Q. Yang, C. W. Ngo, W. Zhang, IEEE Trans. on Image Processing (TIP), vol. 23, no. 2, pp. 527-540, February 2014.

  • Video Event Detection Using Motion Relativity and Feature Selection
    F. Wang, Z. H. Sun, Y. G. Jiang and C. W. Ngo, IEEE Trans. on Multimedia (TMM), vol. 16, no. 5, pp. 1303-1315, 2014.

  • Collaborative Error Reduction for Hierarchical Classification
    S. A. Zhu, X. Y. Wei and C. W. Ngo, Elsevier Journal. on Computer Vision and Image Understanding, vol. 124, pp. 79-90, 2014.

  • Placing Videos on a Semantic Hierarchy for Search Result Navigation
    S. Tan, Y. G. Jiang and C. W. Ngo, ACM Trans. on Multimedia Computing, Communications, and Applications, vol. 10, no. 4, 2014.

  • A Hamming Embedding Kernel with Informative Bag-of-Visual-Words for Video Semantic Indexing
    F. Wang, W. L. Zhao, C. W. Ngo and B. Merialdo, ACM Trans. on Multimedia Computing, Communications, and Applications, vol. 10, no. 3, 2014.

  • Click-boosting Multi-modality Graph-based Reranking for Image Search
    X. Yang, Y. Zhang, T. Yao, C. W. Ngo and T. Mei, Multimedia Systems, vol. 21, no. 2, pp. 217-227, 2014.

  • Name-Face Association in Web Videos: A Large-Scale Dataset, Baselines, and Open Issues
    Z. N. Chen, C. W. Ngo, W. Zhang, J. Cao and Y. G. Jiang, Journal of Computer Science and Technology, vol. 29, no. 5, pp. 785-798, 2014.

  • Click-through-based Cross-view Learning for Image Search
    Y. Pan, T. Yao, T. Mei, H. Li, C. W. Ngo and Y. Rui, ACM Conference on Research and Development in Information Retrieval (SIGIR), Gold Coast, Australia, July 2014. (Oral)

  • Scalable Visual Instance Mining with Threads of Features
    W. Zhang, H. Li, C. W. Ngo and S. F. Chang, ACM Multimedia (ACM MM), Orlando, Florida, USA, 2014. (Oral)

  • Organizing Video Search Results to Adapted Semantic Hierarchies for Topic-based Browsing
    J. Wang, Y. G. Jiang, Q. Wang, K. Yang, C. W. Ngo, ACM Multimedia (ACM MM), Orlando, USA, 2014.

  • Click-through-based Subspace Learning for Image Search
    Y. Pan, T. Yao, X. Tian, H. Li and C. W. Ngo, ACM Multimedia (ACM MM), Orlando, USA, 2014. (Multimedia Grand Challenge)

  • CeleBrowser: An example of browsing big data on small device
    S. Tan, C. W. Ngo, J. Xu and Y. Rui, ACM International Conference on Multimedia Retrieval (ICMR), Glasgow, UK, April 2014. (Demo)

  • CeleLabel: An Interactive System for Annotating Celebrities in Web Videos
    Z. N. Chen, J. F. Bai, C. W. Ngo, B. L. Feng and B. Xu, ACM Multimedia (ACM MM), Orlando, USA, 2014. (Demo)

  • VIREO-TNO @ TRECVID 2014: Multimedia Event Detection and Recounting (MED and MER)
    C. W. Ngo, Y. J. Lu, H. Zhang, T. Yao, C. C. Tan, L. Pang, Maaike de Boer, John Schavemaker, Klamer Schutte and Wessel Kraaij, NIST TRECVID Workshop (TRECVID'14), Orlando, USA, 2014.

  • VIREO @ TRECVID 2014: Instance Search and Semantic Indexing
    W. Zhang, H. Zhang, T. Yao, Y. J. Lu, J. J. Chen, C. W. Ngo, NIST TRECVID Workshop (TRECVID'14), Orlando, USA, 2014.

2013


  • Flip-Invariant SIFT for Copy and Object Detection
    W. L. Zhao and C. W. Ngo, IEEE Trans. on Image Processing, vol. 22, no. 3, pp. 980-991, 2013.

  • Circular Reranking for Visual Search
    T. Yao, C. W. Ngo and T. Mei, IEEE Trans. on Image Processing, vol. 22, no. 4, pp. 1644-1655, 2013.

  • Unified Entity Search in Social Media Community
    T. Yao, Y. Liu, C. W. Ngo and T. Mei, Proc. of International World Wide Web Conference (WWW), Rio de Janeiro, Brazil, May 2013. (Oral)

  • Annotation for Free: Video Tagging by Mining User Search Behavior
    T. Yao, T. Mei, C. W. Ngo and S. P. Li, ACM Multimedia (ACM MM), Barcelona, Catalunya, Spain, October 2013. (Oral)

  • Error Recovered Hierarchical Classication
    S. A. Zhu, X. Y. Wei and C. W. Ngo, ACM Multimedia (ACM MM), Barcelona, Catalunya, Spain, October 2013.

  • Image Search by Graph-based Label Propagation with Image Representation from DNN
    Y. W. Pan, T. Yao, K. Y. Yang, H. Q. Li, C. W. Ngo, J. D. Wang and T. Mei, ACM Multimedia (ACM MM), Barcelona, Catalunya, Spain, October 2013. (Multimedia Grand Challenge)

  • Near-Duplicate Video Retrieval: Current Research and Future Trends
    J. J. Liu, Z. Huang, H. Y. Cai, H. T. Shen, C. W. Ngo and W. Wang, ACM Computing Surveys, 45(4), 2013.

  • Searching Visual Instances with Topology Checking and Context Modeling
    W. Zhang and C. W. Ngo, ACM International Conference on Multimedia Retrieval (ICMR), Dallas, USA, April, 2013. (Oral)

  • Video Concept Detection by Learning from Web Images: A Case Study on Cross Domain Learning
    S. A. Zhu, T. Yao and C. W. Ngo, ICME Workshop on Media Fragment Creation and reMIXing (MMIX'13), San Jose, California, USA, July 2013.

  • Click-boosting Random Walk for Image Search Reranking
    X. P. Yang, Y. D. Zhang, T. Yao, Z.-J. Zha and C. W. Ngo, ICIMCS, Huangshan, Anhui, China, Aug. 2013. (Best Paper Award)

  • VIREO/ECNU @ TRECVID 2013: A Video Dance of Detection, Recounting and Search with Motion Relativity and Concept Learning from Wild
    C. W. Ngo, F. Wang, W. Zhang, C. C. Tan, Z. H. Sun, S. A. Zhu and T. Yao, NIST TRECVID Workshop (TRECVID'13), Gaithersburg, USA, 2013.

  • The Vireo Team at MediaEval 2013: Violent Scenes Detection by Mid-level Concepts Learnt from Youtube
    C. C. Tan and C. W. Ngo, MediaEval 2013 Workshop, Barcelona, Spain, Oct. 2013.

2012


  • Fast Semantic Diffusion for Large-Scale Context-Based Image and Video Annotation
    Y. G. Jiang, Q. Dai, J. Wang C. W. Ngo, X. Y. Xue and S. F. Chang, IEEE Trans. on Image Processing, vol. 21, no. 6, pp. 3080-3091, 2012.

  • Sampling and Ontologically Pooling Web Images for Visual Concept Learning
    S. A. Zhu, C. W. Ngo and Y. G. Jiang, IEEE Trans. on Multimedia, vol. 14, no. 4, pp. 1068-1078, 2012.

  • Summarizing Rushes Videos by Motion, Object and Event Understanding
    F. Wang and C. W. Ngo, IEEE Trans. on Multimedia, vol. 14, no. 1, pp. 76-87, 2012.

  • Boosting Web Video Categorization With Contextual Information from Social Web
    X. Wu, C. W. Ngo, Y. M. Zhu and Q. Peng, World Wide Web Journal, vol. 15, no. 2, pp. 197-212, 2012.

  • VIREO @ TRECVID 2012: Searching with Topology, Recounting will Small Concepts, Learning with Free Examples
    W. Zhang, C.-C. Tan, S. A. Zhu, T. Yao, L. Pang and C.-W. Ngo, NIST TRECVID Workshop (TRECVID'12), Gaithersburg, USA, November 2012.

  • Semantic Indexing and Multimedia Event Detection: ECNU at TRECVID 2012
    F. Wang, Z. Sun, D. Zhang and C.-W. Ngo, NIST TRECVID Workshop (TRECVID'12), Gaithersburg, USA, November 2012.

  • Snap-and-Ask: Answering Multimodal Question by Naming Visual Instance
    W. Zhang, L. Pang and C. W. Ngo, ACM Multimedia (ACM MM), Nara, Japan, October 2012. (Oral)

  • Trajectory-Based Modeling of Human Actions with Motion Reference Points
    Y. G. Jiang, Q. Dai, X. Y. Xue, W. Liu, C. W. Ngo, European Conference on Computer Vision (ECCV), Firenze, Italy, October 2012.

  • Predicting Domain Adaptivity: Redo or Recycle?
    T. Yao, C. W. Ngo and S. A. Zhu, ACM Multimedia (ACM MM), Nara, Japan, October 2012.

  • Community as a Connector: Associating Faces with Celebrity Names in Web Videos
    Z. N. Chen, C. W. Ngo, J. Cao and W. Zhang, ACM Multimedia (ACM MM), Nara, Japan, October 2012.

  • Video Hyperlinking: Libraries and Tools for Threading and Visualizing Large Video Collection
    L. Pang, W. Zhang and C. W. Ngo, ACM Multimedia (ACM MM), Nara, Japan, October 2012.

  • FashionAsk: Pushing Community Answers to Your Fingertips
    W. Zhang, L. Pang and C. W. Ngo, ACM Multimedia (ACM MM), Nara, Japan, October 2012. (Demo)

2011


  • Concept-Driven Multi-Modality Fusion for Video Search
    X. Y. Wei, Y. G. Jiang and C. W. Ngo, IEEE Trans. on Circuits and Systems for Video Technology, vol. 21, no. 1, pp. 62-73, 2011.

  • Tracking Web Video Topics: Discovery, Visualization and Monitoring
    J. Cao, C. W. Ngo, Y. D. Zhang and J. T. Li, IEEE Trans. on Circuits and Systems for Video Technology, vol. 21, no. 12, pp. 1835-1846, 2011.

  • Mining Event Structures from Web Videos
    X. Wu, Y. J. Lu, C. W. Ngo and Q. Peng, IEEE Multimedia, vol. 18, no. 1, pp. 38-51, 2011.

  • Beyond Search: Event Driven Summarization for Web Videos
    R. Hong, J. Tang, H. K. Tan, C. W. Ngo, S. Yan and T. S. Chua, ACM Trans. on Multimedia Computing, Communications, and Applications, vol. 7, no. 4, 2011.

  • VIREO @ TRECVID 2011: Instance Search, Semantic Indexing, Multimedia Event Detection and Known-Item Search
    C.-W. Ngo, S. A. Zhu, W. Zhang, C.-C. Tan, T. Yao, L. Pang and H.-K. Tan, NIST TRECVID Workshop (TRECVID'11), Gaithersburg, USA, December 2011.

  • Cross Media Hyperlinking for Search Topic Browsing
    S. Tan, C. W. Ngo, H. K. Tan and L. Pang, ACM Multimedia (ACM MM), Arizona, USA, November 2011. (Oral)

  • On the Pooling of Positive Examples with Ontology for Visual Concept Learning
    S. A. Zhu, C. W. Ngo and Y. G. Jiang, ACM Multimedia (ACM MM), Arizona, USA, November 2011.

  • Context-based Friend Suggestion in Online Photo Sharing Community
    T. Yao, C. W. Ngo and T. Mei, ACM Multimedia (ACM MM), Arizona, USA, November 2011.

  • Towards Textually Describing Complex Video Contents with Audio-Visual Concept Classifiers
    C. C. Tan, Y. G. Jiang and C. W. Ngo, ACM Multimedia (ACM MM), Arizona, USA, November 2011. (2nd Prize in Multimedia Grand Challenge) Presentation Demo1 Demo2

  • Galaxy Browser: Exploratory Search of Web Videos
    L. Pang, S. Tan, H. K. Tan and C. W. Ngo, ACM Multimedia (ACM MM), Arizona, USA, November 2011. (Demo)
    Demo is avaliable here: Galaxy Browser: Exploratory Search of Web Videos

  • Fusing Heterogeneous Modalities for Video and Image Re-ranking
    H. K. Tan and C. W. Ngo, ACM International Conference on Multimedia Retrieval (ICMR), Trento, Italy, April 2011.

2010


  • On the Annotation of Web Videos by Efficient Near-duplicate Search
    W. L. Zhao, X. Wu and C. W. Ngo, IEEE Trans. on Multimedia, vol. 12, no. 5, pp. 448-461, 2010.

  • Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study
    Y. G. Jiang, J. Yang, C. W. Ngo, A. G. Hauptmann, IEEE Trans. on Multimedia, vol. 12, no. 1, pp. 42-53, 2010.

  • Efficient Mining of Multiple Partial Near-Duplicate Alignments by Temporal Network
    H. K. Tan, C. W. Ngo and T. S. Chua, IEEE Trans. on Circuits and Systems for Video Technology, vol. 20, no. 11, pp. 1486-1498, 2010.

  • Data-Driven Approaches Towards Community-Contributed Video Applications
    X. Wu, C. W. Ngo and W. L. Zhao, IEEE Multimedia, vol. 17, no. 4, pp. 58-69, 2010.

  • VIREO at TRECVID 2010: Semantic Indexing, Known-Item Search, and Content-Based Copy Detection
    C. W. Ngo, S. A. Zhu, H. K. Tan, W. L. Zhao, X. Y. Wei, NIST TRECVID Workshop (TRECVID'10), Gaithersburg, USA, Nov. 2010.

  • Topical Summarization of Web Videos by Visual-Text Time-Dependent Alignment
    S. Tan, H. K. Tan and C. W. Ngo, ACM Multimedia, Firenze, Italy, Oct. 2010.

  • Trajectory-based Visualization of Web Video Topics
    J. Cao, C. W. Ngo, Y. D. Zhang, D. M. Zhang, L. Ma, ACM Multimedia, Firenze, Italy, Oct. 2010.

  • On the Sampling of Web Images for Learning Visual Concept Classifiers
    S. A. Zhu, G. Wang, C. W. Ngo, Y. G. Jiang, International Conference on Image and Video Retrieval (CIVR), Xi'an, China, July 2010. (Oral)

  • Co-reranking by Mutual Reinforcement for Image Search
    T. Yao, T. Mei, C. W. Ngo, International Conference on Image and Video Retrieval (CIVR), Xi'an, China, July 2010. (Oral)

  • Semantic Context Modeling with Maximal Margin Conditional Random Fields for Automatic Image Annotation
    Y. Xiang, X. Zhou, Z. Liu, T. S. Chua, C. W. Ngo, Computer Vision and Pattern Recognition (CVPR), San Francisco, USA, June 2010.

  • PageSense: Style-wise Web Page Advertising
    L. Li, T. Mei, X. Niu, C. W. Ngo, International World Wide Web Conference (WWW), Raleigh, NC, USA, April 2010.

2009


  • Scale-Rotation Invariant Pattern Entropy for Keypoint-based Near-Duplicate Detection
    W. L. Zhao, C. W. Ngo, IEEE Trans. on Image Processing, vol. 18, issue 2, pp. 412-423, 2009.

  • Real-Time Near-Duplicate Elimination for Web Video Search with Content and Context
    X. Wu, C. W. Ngo, A. G. Hauptmann and H. K. Tan, IEEE Trans. on Multimedia, vol. 11, issue 2, pp. 196-207, 2009.

  • Visual Word Proximity and Linguistics for Semantic Video Indexing and Near-Duplicate Retrieval
    Y. G. Jiang, C. W. Ngo, Computer Vision and Image Understanding, vol. 113, issue 3, pp. 405-414, 2009.

  • Localized Matching Using Earth Mover's Distance Towards Discovery of Common Patterns from Small Image Samples
    H. K. Tan, C. W. Ngo, Image and Vision Computing, vol. 27, issue 10, pp. 1470-1483, 2009.

  • VIREO/DVMM at TRECVID 2009: High-Level Feature Extraction, Automatic Video Search, and Content-Based Copy Detection
    C. W. Ngo, Y. G. Jiang, X. Y. Wei, W. L. Zhao, Y. Liu, J. Wang, S. A. Zhu and S. F. Chang, NIST TRECVID Workshop (TRECVID'09), Gaithersburg, MD, USA, Nov. 16-17, 2009.

  • Semantic Context Transfer across Heterogeneous Sources for Domain Adaptive Video Search
    Y. G. Jiang, C. W. Ngo and S. F. Chang, ACM Multimedia (ACM MM), Beijing, China, Oct. 2009. (Oral)

  • Scalable Detection of Partial Near-Duplicate Videos by Visual-Temporal Consistency
    H. K. Tan, C. W. Ngo, R. Hong and T. S. Chua, ACM Multimedia (ACM MM), Beijing, China, Oct. 2009. (Oral)

  • Localizing Volumetric Motion for Action Recognition in Realistic Videos
    X. Wu, C. W. Ngo, J. Li and Y. Zhang, ACM Multimedia (ACM MM), Beijing, China, Oct. 2009.

  • Distribution-based Concept Selection for Concept-based Video Retrieval
    J. Cao, H. F. Jing, C. W. Ngo and Y. D. Zhang, ACM Multimedia (ACM MM), Beijing, China, Oct. 2009.

  • Towards Google Challenge: Combining Contextual and Social Information for Web Video Categorization
    X. Wu, W. L. Zhao, and C. W. Ngo, ACM Multimedia (ACM MM), Multimedia Grand Challenge, Beijing, China, Oct. 2009.

  • Event Driven Summarization for Web Videos
    R. Hong, J. Tang, H. K. Tan, S. Yan, C. W. Ngo and T. S. Chua, ACM Multimedia Workshop on Social Media (WSM), Beijing, China, Oct. 2009.

  • Domain Adaptive Semantic Diffusion for Large Scale Context-Based Video Annotation
    Y. G. Jiang, J. Wang, S. F. Chang and C. W. Ngo, Int. Conf. on Computer Vision (ICCV), Kyoto, Japan, Sept. 2009.

  • Exploring Inter-concept Relationship with Context Space for Semantic Video Indexing
    X. Y. Wei, Y. G. Jiang, and C. W. Ngo, International Conference on Image and Video Retrieval (CIVR), Santorini, GR, Jul. 2009. (Oral)

  • Large-scale Near-duplicate Web Video Search: Challenge and Opportunity
    W. L. Zhao, S. Tan , and C. W. Ngo, Int'l Conference on Multimedia & Expo (ICME), Workshop on Internet Multimedia Search and Mining, Cancun, Mexico, Jul. 2009. (Oral)

  • A Revisit of Generative Model for Automatic Image Annotation using Markov Random Fields
    Y. Xiang, X. Zhou, T. S. Chua and C. W. Ngo, Computer Vision and Pattern Recognition (CVPR), Miami, Florida, USA, Jun. 2009.

2008


  • Selection of Concept Detectors for Video Search by Ontology-Enriched Semantic Spaces
    X. Y. Wei, C. W. Ngo and Y. G. Jiang, IEEE Trans. on Multimedia, vol. 10, issue 6, pp. 1085-1096, 2008.

  • Structuring Low-quality Videotaped Lectures for Cross-Reference Browsing by Video Text Analysis
    F. Wang, C. W. Ngo, and T. C. Pong, Pattern Recognnition, vol. 41, no. 10, pp. 3257-3269, Oct 2008.

  • Simulating a Smartboard by Real-Time Gesture Detection in Lecture Videos
    F. Wang, C. W. Ngo, and T. C. Pong, IEEE Trans. on Multimedia, volume 10, no. 5, Aug. 2008.

  • Multi-Modal News Story Clustering with Pairwise Visual Near-Duplicate Constraint
    X. Wu, C. W. Ngo and A. G. Hauptmann, IEEE Trans. on Multimedia, volume 10, issue 2, pp. 188-199, February 2008.

  • Novelty and Redundancy Detection with Multimodalities in Cross-Lingual Broadcast Domain
    X. Wu, A. G. Hauptmann and C. W. Ngo, Computer Vision and Image Understanding, volume 110, issue 3, pp. 418-431, June 2008.

  • Beyond Semantic Search: What You Observe May Not Be What You Think
    C. W. Ngo, Y. G. Jiang, X. Y. Wei, W. L. Zhao, F. Wang, X. Wu, H. K. Tan, NIST TRECVID Workshop (TRECVID'08), Gaithersburg, MD, USA, Nov. 17-18, 2008.

  • Columbia University/VIREO-CityU/IRIT TRECVID2008 High-Level Feature Extraction and Interactive Video Search
    S. F. Chang, J. He, Y. G. Jiang, E. El Khoury, C. W. Ngo, A. Yanagawa, E. Zavesky, NIST TRECVID Workshop (TRECVID'08), Gaithersburg, MD, USA, Nov. 17-18, 2008.

  • Video Event Detection Using Motion Relativity and Visual Relatedness
    F. Wang, Y. G. Jiang, and C. W. Ngo, ACM Multimedia (ACM MM), Vancouver, Canada, Oct. 2008. (Oral)

  • Fusing Semantics, Observability, Reliability and Diversity of Concept Detectors for Video Search
    X. Y. Wei, C. W. Ngo, ACM Multimedia (ACM MM), Vancouver, Canada, Oct. 2008. (Oral)

  • Modeling Video Hyperlinks with Hypergraph for Web Video Reranking
    H. K. Tan, C. W. Ngo, and X. Wu, ACM Multimedia (ACM MM), Vancouver, Canada, Oct. 2008.

  • Accelerating Near-Duplicate Video Matching by Combining Visual Similarity and Alignment Distortion
    H. K. Tan, X. Wu, C. W. Ngo, and W. L. Zhao, ACM Multimedia (ACM MM), Vancouver, Canada, Oct. 2008.

  • CU-VIREO374: Fusing Columbia374 and VIREO374 for Large Scale Semantic Concept Detection
    Y. G. Jiang, A. Yanagawa, S. F. Chang, C. W. Ngo, Columbia University ADVENT Technical Report #223-2008-1, 2008.

  • Bag-of-Visual-Words Expansion Using Visual Relatedness for Video Indexing
    Y. G. Jiang, C. W. Ngo, ACM SIGIR 2008, Singapore, Jul. 2008.

  • Ontology-Based Visual Word Matching for Near-Duplicate Retrieval
    Y. G. Jiang, C. W. Ngo, Int'l Conference on Multimedia & Expo (ICME), Hannover, Germany, Jun. 2008. (Oral)

2007


  • Near-Duplicate Keyframe Identification with Interest Point Matching and Pattern Learning
    W. L. Zhao, C. W. Ngo, H. K. Tan, X. Wu, IEEE Trans. on Multimedia, vol. 9, pp. 1037-1048, Aug 2007.

  • Moving Object Detection, Association and Selection in Home Videos
    Z. Pan, C. W. Ngo, IEEE Trans. on Multimedia, vol. 9, no. 2, Feb 2007.
    [Sample Clips, Ground-truth and Results]

  • Lecture Video Enhancement and Editing by Integrating Posture, Gesture and Text
    F. Wang, C. W. Ngo, and T. C. Pong, IEEE Trans. on Multimedia, vol. 9, no. 2, Feb 2007.

  • OM-based Video Shot Retrieval by One-to-One Matching
    Y. X. Peng, C. W. Ngo, Multimedia Tools and Applications, vol. 34, issue 2, pp. 249-266, Aug 2007.

  • Practical Elimination of Near-Duplicates from Web Video Search
    X. Wu, A. G. Hauptmann and C. W. Ngo, ACM Multimedia (MM'07), Augsburg, Germany, Sep. 2007. (oral)

  • Novelty Detection for Cross-Lingual News Stories with Visual Duplicates and Speech Transcripts
    X. Wu, A. G. Hauptmann and C. W. Ngo, ACM Multimedia (MM'07), Augsburg, Germany, Sep. 2007. (oral)

  • Ontology-Enriched Semantic Space for Video Search
    X. Y. Wei, C. W. Ngo, ACM Multimedia (MM'07), Augsburg, Germany, Sep. 2007. (oral)

  • Rushes Video Summarization by Object and Event Understanding
    F. Wang, C. W. Ngo, TRECVID BBC Rushes Summarization Workshop at ACM Multimedia (MM'07), Augsburg, Germany, Sep. 2007.

  • Experimenting VIREO-374: Bag-of-Visual-Words and Visual-Based Ontology for Semantic Video Indexing and Search
    C. W. Ngo, Y. G. Jiang, X. Wei, F. Wang, W. Zhao, H.-K. Tan, X. Wu, NIST TRECVID Workshop (TRECVID'07), Gaithersburg, MD, USA, Nov. 5-6, 2007.

  • Evaluating Bag-of-Visual-Words Representations in Scene Classification
    J. Yang, Y. G. Jiang, A. G. Hauptmann, C. W. Ngo, ACM SIGMM Int'l Workshop on Multimedia Information Retrieval (MIR'07), Augsburg, Germany, Sep. 2007.

  • Near-Duplicate Keyframe Retrieval with Visual Keywords and Semantic Context
    X. Wu, W. L. Zhao, C. W. Ngo, International Conference on Image and Video Retrieval (CIVR), 2007.

  • Towards Optimal Bag-of-Features for Object Categorization and Semantic Video Retrieval
    Y. G. Jiang, C. W. Ngo, J. Yang, International Conference on Image and Video Retrieval (CIVR), 2007.

  • Efficient Near-Duplicate Keyframe Retrieval with Visual Language Models
    X. Wu, W. L. Zhao, C. W. Ngo, International Conference on Multimedia and Expo (ICME), 2007.

  • Mining Multiple Visual Appearance of Semantics for Image Annotation
    H. K. Tan & C. W. Ngo, International Conference on Multimedia Modeling (MMM), 2007.

2006


  • Clip-based Similarity Measure for Query-dependent Clip Retrieval and Video Summarization
    Y. X. Peng, C. W. Ngo, IEEE Trans. on Circuits and Systems for Video Technology, 16(5), pp. 612-627, 2006.

  • Threading and Autodocumenting in News Videos
    X. Wu, C. W. Ngo, Q. Li, IEEE Signal Processing Magazine, 23(2), pp. 59-68, March 2006.

  • Gestalt-based Feature Similarity Measure in Trademark Database
    H. Jiang, C. W. Ngo & H. K. Tan, Pattern Recognition, 39(5), pp. 988-1001, 2006.

  • Discovery of Near-Duplicate and Common Visual Concepts
    C. W. Ngo, Dagstuhl Seminar: Content-based Retrieval, Schloss Dagstuhl, Germany 2006. (seminar, invited)
    [http://kathrin.dagstuhl.de/06171/Materials2/]

  • Fast Tracking of Near-Duplicate Keyframes in Broadcast Domain with Transitivity Propagation
    C. W. Ngo, W. L. Zhao, Y. G. Jiang, ACM Multimedia Conference (MM'06), Oct 2006.
    [Dataset and Ground-truth]

  • Audio Similarity Measure by Graph Modeling and Matching
    Y. X. Peng, C. W. Ngo and C. Fang, ACM Multimedia Conference (MM'06), Oct 2006.

  • Modeling Local Interest Points for Semantic Detection and Video Search at TRECVID 2006
    Y. G. Jiang, X. Wei, C. W. Ngo, H. K. Tan, W. L. Zhao, X. Wu, TRECVID Workshop, 2006.

  • Hierarchical Hidden Markov Model for Rushes Structuring and Indexing
    C. W. Ngo, Z. Pan, X. Y. Wei, International Conference on Image and Video Retrieval (CIVR), 2006.

  • Keyframe Retrieval by Keypoints: Can Point-to-Point Matching Help?
    W. L. Zhao, Y. G. Jiang, C. W. Ngo, International Conference on Image and Video Retrieval (CIVR), 2006.

  • Exploring Semantic Concept Using Local Invariant Features
    Y. G. Jiang, W. L. Zhao, C. W. Ngo, Asia-Pacific Workshop on Visual Information Processing, 2006. (invited)

  • Prediction-based Gesture Detection in Lecture Videos by Combining Visual, Speech and Electronic Slides
    F. Wang, C. W. Ngo, T. C. Pong, International Conference on Multimedia and Expo (ICME), July 2006.

2005


  • Video Summarization and Scene Detection by Graph Modeling
    C. W. Ngo, Y. F. Ma & H. J. Zhang, IEEE Trans. on Circuits and Systems for Video Technology, 15(2), pp. 296-305, 2005.

  • Selective Object Stabilization for Home Video Consumers
    Z. Pan, C. W. Ngo, IEEE Trans. on Consumer Electronics, 51(4), pp. 1074-1084, Nov 2005.

  • Video Text Detection and Segmentation for Optical Character Recognition
    C. W. Ngo, C. K. Chan, Multimedia Systems, vol. 10, no. 3, pp. 261-272, 2005.

  • Exploiting Self-Adaptive Posture-based Focus Estimation for Lecture Video Editing
    F. Wang, C. W. Ngo & T. C. Pong, ACM Multimedia Conference (MM'05), Nov 2005.

  • Common Pattern Discovery using Earth Mover's Distance and Local Flow Maximization
    H. K. Tan & C. W. Ngo, Int. Conf. on Computer Vision (ICCV'05), Oct 2005.

  • Motion Driven Approaches to Shot Boundary Detection, Low-Level Feature Extraction and BBC Rushes Characterization at TRECVID 2005
    C. W. Ngo, Z. Pan, X. Wei, X. Wu, H. K. Tan & W. Zhao, TRECVID workshop, 2005.

  • Co-Clustering of Time-Evolving News Story with Transcript and Keyframe
    X. Wu, C. W. Ngo & Q. Li, Int. Conf. on Multimedia and Expo (ICME), July 2005.

  • EMD-based Video Clip Retrieval by Many-to-Many Matching
    Y. Peng, C. W. Ngo, Int. Conf on Image and Video Retrieval (CIVR), July 2005.

  • Hot Event Summarization by Graph Modeling and Matching
    Y. Peng, C. W. Ngo, Int. Conf on Image and Video Retrieval (CIVR), July 2005.

  • Shot-based Video Retrieval by Integrating Color and Motion Features
    Y. Peng, C. W. Ngo, Int. Conf. on Intelligent Multimedia Computing and Networking, July 2005.

  • A Semantic View Mechanism for User-centric Video Adaptation
    D. Ding, F. Wang, C. W. Ngo, Q. Li, Int. Conf. on Intelligent Multimedia Computing and Networking, July 2005.

2004


  • Indexing and Matching of Polyphonic Songs for Query-by-Singing System
    T. W. Leung & C. W. Ngo, ACM Multimedia Conference (MM'04), 2004.

  • Graph based Image Matching
    H. Jiang & C. W. Ngo, Int. Conference on Pattern Recognition (ICPR), 2004.

  • Novel Seed Selection for Multiple Object Detection and Tracking
    Z. Pan and C. W. Ngo, Int. Conf on Pattern Recognition (ICPR), 2004.

  • ICA-FX Features for Classification of Singing Voice and Instrumental Sound
    T. W. Leung, C. W. Ngo & W. H. Lau, Int. Conference on Pattern Recognition (ICPR), 2004.

  • Gesture Tracking and Recognition for Lecture Video Editing
    F. Wang, C. W. Ngo & T. C. Pong, Int. Conf. on Pattern Recognition (ICPR), 2004.

  • Deformable Geometry Model Matching by Topological and Geometric Signatures
    K. L. Tam, R. Lau & C. W. Ngo, Int. Conf. on Pattern Recognition (ICPR), 2004.

  • Deformable Geometry Model Matching Using Bipartite Graph
    K. L. Tam, R. Lau & C. W. Ngo, Proc. of Computer Graphics International (CGI), 2004.

  • A Robust Method for Recovering Geometric Proxy from Multiple Panoramic Images
    A. Wan, A. Siu, R. Lau & C. W. Ngo, Int. Conf. on Image Processing (ICIP), 2004.

  • Structuring Home Video by Snippet Detection and Pattern Parsing
    Z. Pan & C. W. Ngo, ACM SIGMM Int. Workshop on Multimedia Information Retrieval (MIR), 2004.

  • Clip-based Similarity Measure for Hierarchical Video Retrieval
    Y. Peng & C. W. Ngo, ACM SIGMM It. Workshop on Multimedia Information Retrieval (MIR), 2004.

2003


  • Motion Analysis and Segmentation through Temporal Slices Processing
    C. W. Ngo, T. C. Pong & H. J. Zhang, IEEE Trans. on Image Processing, vol. 12, no. 3, pp. 341-355, 2003.

  • Automatic Video Summarization by Graph Modeling
    C. W. Ngo, Y. F. Ma & H. J. Zhang, Int'l Conf on Computer Vision (ICCV'03), 2003.

  • A Robust Dissolve Detector by Support Vector Machine
    C. W. Ngo, ACM Multimedia Conference (MM'03), 2003.

  • Synchronization of Lecture Videos and Electronic Slides by Video Text Analysis
    F. Wang, C. W. Ngo & T. C. Pong, ACM Multimedia Conference (MM'03), 2003.

  • Image Mining using Inexact Maximal Common Subgraph of Multiple ARGs
    H. Jiang & C. W. Ngo, Int. Conf. on Visual Information Systems, 2003.

  • Structuring Lecture Video for Distance Learning Application
    C. W. Ngo, F. Wang & T. C. Pong, International Symposium on Multimedia Software Engineering, 2003.

  • Video Text Detection and Segmentation
    C. W. Ngo & C. K. Chan, Int. Conf. on Visual Information Systems, 2003.

  • Video Clip Retrieval by Maximal Matching and Optimal Matching in Graph Theory
    Y. Peng, C. W. Ngo & et. al., Int. Conf. Multimedia Expo (ICME), 2003.

  • Detection of Documentary Scene Changes by Audio-visual Fusion
    A. Velivell, C. W. Ngo & T. S. Huang, Int. Conf. on Image and Video Retrieval (CIVR), 2003.

2002


  • On Clustering and Retrieval of Video Shots through Temporal Slices Analysis
    C. W. Ngo, T. C. Pong & H. J. Zhang, IEEE Trans. on Multimedia, Vol. 4, No. 4, pp. 446-459, 2002.

  • Motion-based Video Representation for Scene Change Detection
    C. W. Ngo, T. C. Pong & H. J. Zhang, International Journal of Computer Vision, vol. 50, no. 2, pp. 127-143, 2002.

  • Detection of Slide Transition for Topic Indexing
    C. W. Ngo, T. C. Pong & T. S. Huang, Int. Conf. on Multimedia Expo (ICME), 2002.

  • Motion Retrieval by Temporal Slice Analysis
    C. W. Ngo, T. C. Pong & H. J. Zhang, Int. Conf. on Pattern Recognition (ICPR), 2002.

2001


  • Video Partitioning through Temporal Slices Analysis
    C. W. Ngo, T. C. Pong & R. T. Chin, IEEE Trans. on Circuits and Systems for Video Technology, 11(8), pp. 941-953, 2001.

  • Exploiting Image Indexing Techniques in DCT Domain
    C. W. Ngo, T. C. Pong & R. T. Chin, Pattern Recognition, vol. 34, no.9, pp. 1841-1851, 2001.

  • Recent Advances in Content Based Video Analysis
    C. W. Ngo, H. J. Zhang & T. C. Pong, International Journal of Image and Graphic, Vol. 1, No. 3, pp. 445-469, 2001.

  • On Clustering and Retrieval of Video Shots
    C. W. Ngo, T. C. Pong & H. J. Zhang, ACM Multimedia Conference (MM'01), 2001.

  • Integrating Color and Spatial Features for Content-based Video Retrieval
    T. Lin, C. W. Ngo, H. J. Zhang & Q. Y. Shi, Int. Conf. on Image Processing (ICIP), 2001.

2000


  • Motion Characterization by Temporal Slice Analysis
    C. W. Ngo, T. C. Pong, H. J. Zhang & R. T. Chin, Computer Vision and Pattern Recognition (CVPR'00), 2000

  • Motion-based Video Representation for Scene Change Detection
    C. W. Ngo, T. C. Pong, H. J. Zhang & R. T. Chin, Int. Conf. Pattern Recognition (ICPR), 2000.

  • A Robust Wipe Detection Algorithm
    C. W. Ngo, T. C. Pong & R. T. Chin, Asian Conference on Computer Vision (ACCV), 2000.

1999


  • Motion Tracking of Human Mouth by Generalized Deformable Models
    S. Chan, C. W. Ngo & Kok F. Lai, Pattern Recognition Letters, vol. 20, pp. 879-887, 1999.

  • Detection of Gradual Transitions through Temporal Slice Analysis
    C. W. Ngo, T. C. Pong & R. T. Chin, Computer Vision and Pattern Recognition (CVPR'99), 1999.

  • Camera Breaks Detection by Partitioning of 2D Spatial-temporal Images in MPEG Domain
    C. W. Ngo, T. C. Pong & R. T. Chin, IEEE Multimedia Systems (ICMCS), Italy, 1999.

1998 and earlier


  • A Survey of Video Parsing and Image Indexing Techniques in Compressed Domain
    C. W. Ngo, T. C. Pong & R. T. Chin, Symposium on Image, Speech, Signal Processing, and Robotics (Workshopon Computer Vision), Hong Kong, 1998.

  • Exploiting Image Indexing Techniques in DCT Domain
    C. W. Ngo, T. C. Pong & R. T. Chin, IAPR International Workshop on Multimedia Information Analysis & Retrieval, Hong Kong, 1998

  • Experiments on Routing, Filtering and Chinese Text Retrieval in TREC-5
    C. W. Ngo & Kok F. Lai, The Fifth Text Retrieval Conference (TREC-5), Gaithersburg, 1997.

  • Tracking of Deformable Contours by Synthesis and Match
    Kok F. Lai, C. W. Ngo & S. Chan, Proc. Int. Conf. Pattern Recognition, Vol. 1, Vienna, Austria, Aug 1996.

  • Motion Tracking and Analysis of Deformable Objects by Generalized Active Contour Model
    C. W. Ngo, S. Chan & Kok F. Lai, Second Asian Conference on Computer Vision, Singapore, 1995.

  • Application of Generalized Active Contour Model for Model-Based Image Coding
    C. W. Ngo, S. Chan & Kok F. Lai, International Conference on Multimedia Modeling, Singapore, 1995.

  • A High Speed Distributed File System for Multimedia Communications
    C. L. Chee, S. S. Erdogan, C. W. Ngo, C. K. Wong, Proceedings of IEEE Region 10's Ninth Annual International Conference, vol. 1, Singapore, 1994

Copyright Notice: The papers presented here are to ensure timely dissemination of scholarly and technical work and only for personal or classroom use. Copyright and all rights therein are retained by authors and/or by other copyright holders.