NExT is an acronym for “NUS-Extreme-Tsinghua”

It is a center jointly setup between National University of Singapore (NUS) and Tsinghua University with the support of Media Development Authority (MDA) of Singapore. The term extreme search refers to the ability to search for live and dynamic data beyond what is indexed in the Web.

The center focuses on crawling and mining user-generated contents (UGC’s) in Singapore and Beijing in the following areas:

  1. location-oriented information: includes shared photos and check-in venues;
  2. topic-oriented information: covers forums, question-answering, tweets;
  3. application-oriented information: covers mobile applications and associated information and discussions; and
  4. structured information: includes factual, cultural and historical information.

These are publicly available data sources documenting the social interactions of people within a city. It will cover social activities of people and their collective preferences and interests. This gives rise to a graph of social pulses, and the research is focusing on transforming the unstructured, multisource and multimodal UGC contents into the graph depicting the social pulses of a city. The Center will work closely with other MDA funded centers to integrate their resources such as the sensor data streams (from CSI), language translation and resources (from CSIDM), as well as various MDA start-up companies and government agencies to aggregate related structured contents. The Center will offer the organized information base, as well as related information processing tools, available to other research entities, academic institutes and start-up companies to develop innovative applications to support the users.

Research Focuses

Many research efforts have been done on the analysis, indexing, fusion and retrieval of Web, deep Web and multimedia contents. This centre focuses instead on the novel and more challenging topic of handling live information sources. To make these live and dynamic data available for general access, we need to tackle the following research issues.

  1. Develop distributed and database infrastructures to capture, analyze, and index the huge amount of continuous live data streams and their key analytics.
  2. Bridge the semantic gaps of understanding the contents of non-textual data arising from video, audio and other type of sensors.
  3. Build tools to analyze the contents of media sensors, the multilingual and often ungrammatical Blog and forum postings, and information generated from vast number of mobile devices.
  4. Fuse these live information sources to extract the latest events and understand the current hot topics of discussion and their sentiment, as well as the relationships among users and between users and resources.
  5. Build mobile platform to realize a conducive, informed environment to help users lead better life. Initial platforms are planned for campus and health care domain.

Vision and Aim

The vision of the center is to find and extract meaning from millions of real-time and mobile data streams. By tackling the problems at life scale with millions of data streams and in large geographical coverage (between Singapore and Beijing), we aim to tackle difficult real-life problem and achieve world prominence in live mobile media search.

The Evolving Web

Internet has revolutionalized the way we create, disseminate and consume information. The users have changed from passive recipients of information to active consumers of information. The nature of information is also changed from static and purely text, to dynamic, mobile and multimedia.

Eric Schmidt, the CEO of Google, has estimated the size of indexed Web to be about 5 million Terabytes (2010). This estimate does not include the size of deep (or un-indexed) Web, which is about 500 times larger. Like deep Web, the amount of information available from the live information sources (from live sensors, dynamic online forums, blogs and mobile devices) is huge and unknown. A combination of these information sources provides us with answers to possibly all questions and deep knowledge on users and their relationships with other users and resources.

© Copyright 2011 National University of Singapore. Designed by Jovian Lin. All Rights Reserved.