Grants and Contributions:

Title:
Cross-Domain Data Analytics
Agreement Number:
RGPIN
Agreement Value:
$20,000.00
Agreement Date:
May 10, 2017 -
Organization:
Natural Sciences and Engineering Research Council of Canada
Location:
Ontario, CA
Reference Number:
GC-2017-Q1-01899
Agreement Type:
Grant
Report Type:
Grants and Contributions
Additional Information:

Grant or Award spanning more than one fiscal year. (2017-2018 to 2018-2019)

Recipient's Legal Name:
Capretz, Miriam (The University of Western Ontario)
Program:
Discovery Grants Program - Individual
Program Purpose:

The amount of data generated and recorded every day is growing at an unprecedented rate as a result of developments in Web technologies, social media, and mobile and sensor devices. This growth has led to massive data sets, which if properly analyzed, can provide valuable insights and knowledge.

Data analytics aim to uncover useful information from data using techniques from statistics, machine learning, data mining, and other areas. Due to the size and diversity of the collected data sets, conducting data analytics today is challenging. First, it is typically insufficient to analyze data from a single domain; single domain data must be supplemented with data from other sources. For instance, customers' historical purchase records can be enriched with weather and social media information to create a richer analysis of shopping habits. Moreover, the sheer size of data poses challenges to existing analytics algorithms, which must be adapted or even redesigned to take modern run-time environments into consideration. Finally, visualizing large data sets and the progress of analytics processes is essential to users, but is not entirely supported by current tools.

This research program aims to advance the field of cross-domain data analytics by devising a comprehensive architecture that can handle large heterogeneous data sets and provides users with rich and interactive visualizations of the analysis process. Hence, four main directions will be the focus of this research program. First, it will investigate cross-domain fusion techniques with the aim of integrating data and analytics processes from different domains and improving the quality and diversity of insights obtained. Second, this program will research transfer, ensemble, and reinforcement learning paradigms to enable them to work with large cross-domain data sets. Third, it will focus on creating a unified abstraction and accompanying run-time platfo rm that seamlessly fuses different learning processes and operates on large-scale data. Finally, this program will develop novel visualization techniques to present the knowledge induced during learning as well as the learning process itself.

The proposed research program is expected not only to push the boundaries of data analytics research but also to assist organizations and individuals that are interested in obtaining insights from large-scale cross-domain data. By broadening the impact of data analytics, this program will assist Canadian companies to create a competitive advantage and will place Canada in a leadership position in this emerging field.