Research

Future Students

Data Science Lab is hiring again!

We are looking for two talented graduate students in the broad areas of Big Data Analytics / Machine Learning / Data Science. Send an e-mail if you are interested to apply.

Research Interests:

  • Big Data, Data Analytics, Machine Learning
  • Data Curation, Data Discovery, Data Cleaning
  • Data Integration, Heterogeneous Datasets
  • Business Intelligence, Query and Systems Tuning
  • Web Search, Graph Search, Web Semantic

Publications:

2021

  • NEW: B. Askari, J. Szlichta, A. Salehi-Abari. Variational Autoencoders for Top-K Recommendation with Implicit Feedback. ACM SIGIR, 4 pages, 2021.
  • NEW: R. Hamidi, E. Bagheri, M. Kargar, D. Srivastava, J. Szlichta. Retrieving Skill-Based Teams from Collaboration Network. ACM SIGIR, 4 pages, 2021.
  • M. Kargar, L. Golab, D. Srivastava, J. Szlichta, M. Zihayat. Effective Keyword Search over Weighted Graph Social Networks. IEEE TKDE, 16 pages, 2021.
  • R. Karegar, L. Godfrey, L. Golab, M. Kargar, J. Szlichta, D. Srivastava. Efficient Discovery of Approximate Order Dependencies. EDBT, 27-432, 2021.
  • M. Kargar, L. Golab, D. Srivastava, J. Szlichta, M. Zihayat. Effective Keyword Search in Weighted Graphs (Extended Abstract). IEEE ICDE, 2 pages, 2021.
  • S. Bryson, C. Henderson, V. Corvinelli, P. Godfrey, P. Mierzejewski, J. Szlichta, C. Zuzarte. Database Management Systems Tuning through AI. Canadian AI, industry track, 4 pages, 2021.
  • M. Kargar, J. Szlichta, M. Zihayat. Environmentally Friendly Tour Recommendations using Sustainable Transporters. Canadian Operational Research Society (CORS), abstract submission, 2021.

2020

  • S. Bryson, H. Davoudi, L. Golab, M. Kargar, Y. Lytvyn, P. Mierzejewski, J. Szlichta, M. Zihayat. Robust Keyword Search in Large Attributed Graphs. Information Retrieval Journal, Springer, 23(5): 502-524 (2020).
  • A. Khan, L. Golab, M. Kargar, J. Szlichta, M. Zihayat. Compact Group Discovery in Attributed Graphs and Social Networks. Information Processing and Management, Elsevier, 1-18, 2020.
  • P. Li, J. Szlichta, M. Böhlen and D. Srivastava. Discovering Band Order Dependencies. IEEE ICDE, 1878-1881.
  • Radin Hamidi, H. Fani, M. Kargar, J. Szlichta, Ebrahim Bagheri. Learning to Form Skill-based Teams of Experts. ACM CIKM, 2049-2052, 2020.
  • J. Szlichta, P. Godfrey, L. Golab, M. Kargar, D. Srivastava. Erratum for Discovering Order Dependencies through Order Compatibility. EDBT, 659-663, 2020.

2019

  • G. Damasio, V. Corvinelli, P. Godfrey, P. Mierzejewski, A. Mihaylov,  J. Szlichta, C. Zuzarte. Guided Automated Learning for query workload re-Optimization. PVLDB 12(12): 2010-2021, 2019.
  •  G. Damasio, S. Bryson, V. Corvinelli, P. Godfrey, P. Mierzejewski, J. Szlichta, C. Zuzarte. GALO: Guided Automated Learning for re-Optimization. PVLDB, 12(12): 1778-1781, 2019.
  • M. Zihayat, M. Kargar, J. Szlichta. A Survey of High Utility Pattern Mining Algorithms for Big Data. High-Utility Pattern Mining: Theory, Algorithms and Applications, Springer, 75-96, 2019.
  • H. Davoudi, P. Godfrey, L. Golab, M. Kargar and D. Srivastava,  J. Szlichta . Bringing Order to Data. AMW, 5.1-5, 2019.
  • M. Kargar, M. Zihayat, J. Szlichta. Mining and Exploration of Attributed Graphs: Theory and Applications. ACM CASCON x EVOKE (organized by IBM), 2 pages, 2019.

2018

  • J. Szlichta, P. Godfrey, L. Golab, M. Kargar and D. Srivastava. Effective and Complete Discovery of Bidirectional Order Dependencies via Set-based Axioms. VLDB Journal 27(4): 573-591, 2018.
  • A. Mihaylov, P. Godfrey, L. Golab, M. Kargar, D. Srivastava and J. Szlichta. FastOD: Bringing Order to Data. IEEE ICDE, 1561-1564, 2018.
  • Z. Zheng, M. Alipour Langouri, Z. Qu, I. Currie, F. Chiang, L. Golab and J. Szlichta. FastOFD: Contextual Data Cleaning with Ontology Functional Dependencies. EDBT, 694-697, 2018.
  • M. Alipour-Langouri, Z. Zheng, F. Chiang, L. Golab and J. Szlichta. Contextual Data Cleaning. IEEE ICDE workshop on Context in Analytics, 21-24, 2018.
  • M. Zihayat, A. An, L. Golab, M. Kargar and J. Szlichta. Effective Team Formation in Expert Networks. AMW, 4.1-4, 2018.
  • J. Szlichta: Order Dependencies. Encyclopedia of Database Systems, Springer, Editors: L. Liu (Georgia Tech) and T. Özsu (University of Waterloo), 2 pages, 2018.

2017

  • J.Szlichta, P. Godfrey, L. Golab, M. Kargar and D. Srivastava: Effective and Complete Discovery of Order Dependencies via Set-based Axiomatization. PVLDB 10(7): 721-732, 2017.
  • S. Baskaran, A. Keller, F. Chiang, L. Golab and J. Szlichta, Efficient Discovery of Ontology Functional Dependencies, ACM CIKM 2017, 1847-1856.
  • M. Zihayat, A. An, L. Golab, M. Kargar and J. Szlichta: Authority-Based Team Discovery in Social Networks. EDBT, 498-501, 2017.
  • M. Kargar, A. An, P. Godfrey, J. Szlichta and X. YuL Meaningful Keyword Search over Databases with Complex Schema. AMW, 4 pages, 2017.

2016

  • G. Damasio, P. Mierzejewski, J. Szlichta and C. Zuzarte: Query Performance Problem Determination with Knowledge Base in Semantic Web System OptImatch. EDBT 2016, 515-526.
  • M. Kargar, L. Golab and J. Szlichta: eGraphSearch: Effective Keyword Search in Graphs. ACM CIKM 2016, 2461-2464.
  • G. Damasio, P. Mierzejewski, J. Szlichta and C. Zuzarte: OptImatch: Semantic Web System for Query Problem Determination. IEEE ICDE 2016, 1334-1337.
  • M. Ferron, K. Pu and J. Szlichta: ARC: A Pipeline Approach Enabling Large-Scale Graph Visualization. ACM/IEEE ASONAM 2016, 1397-1400.
  • A. Keller, J. Szlichta: Ontology Functional Dependencies. AMW 2016, 4 pages.

2015

  • N. Prokoshyna, J. Szlichta, F. Chiang, R. J. Miller and D. Srivastava: Combining Quantitative and Logical Data Cleaning. PVLDB 9(4): 300-311, 2015.
  • M. Kargar, A. An, N. Cercone, P. Godfrey, J. Szlichta and  X. Yu: Meaningful Keyword Search in Relational Databases with Large and Complex Schema. IEEE ICDE 2015, 411-422.
  • J. Szlichta, Lukasz Golab, D. Srivastava:
    On Axiomatization and Inference Complexity over a Hierarchy of Functional Dependencies. AMW 2015, 12 pages.

2014

  • M. Volkovs, F. Chiang, J. Szlichta, R. Miller: Continuous data cleaning. IEEE ICDE 2014: 244-255.
  • J. Szlichta, P. Godfrey, J. Gryz, W. Ma, W. Qiu, Calisto Zuzarte: Business-Intelligence Queries with Order Dependencies in DB2. EDBT 2014: 750-761.
  • M. Kargar, A. An, N. Cercone, P. Godfrey, J. Szlichta, X. Yu: MeanKS: meaningful keyword search in relational databases with complex schema. ACM SIGMOD Conference 2014: 905-908.

2013

  • J. Szlichta, P. Godfrey, J. Gryz, C. Zuzarte: Expressiveness and Complexity of Order Dependencies. PVLDB 6(14): 1858-1869, 2013.
  • J. Szlichta, P. Godfrey, J. Gryz, C. Zuzarte: Axiomatic System for Order Dependencies. AMW, 4 pages, 2013.

2012

  • J. Szlichta, P. Godfrey, J. Gryz: Fundamentals of Order Dependencies. PVLDB 5(11): 1220-1231, 2012.
  • J. Szlichta, P. Godfrey, J. Gryz: Chasing Polarized Order Dependencies. AMW 2012: 168-179.

2011

  • J. Szlichta, P. Godfrey, J. Gryz, W. Ma, P. Pawluk, C. Zuzarte: Queries on dates: fast yet not blind. EDBT 2011: 497-502.

For a list of dblp publications, please visit link

Current and Past Collaborators:

  • Divesh Srivastava (AT&T Lab-Research)
  • Lukasz Golab, Ihab Ilyas (University of Waterloo)
  • Fei Chiang (McMasters)
  • Michael Böhlen (University of Zurich)
  • Renée Miller (University of Toronto)
  • Aijun An, Parke Godfrey and Jarek Gryz (York University)
  • Vincent Corvinelli, Wenbin Ma, Piotr Mierzejewski, Calisto Zuzarte (IBM Lab)
  • Mehdi Kargar, Morteza Zihayat (Ryerson University)
  • Ken Pu, Amirali Abari (Ontario Tech University)