Big Data Analysis Group
The Big Data Analysis Group is a core research group in the State Key Laboratory of Software Development Environment, School of Computer Science and Engineering, Beihang University. Our research topics include spatiotemporal big data analysis and processing, crowd intelligence, crowdsourcing, federated learning, and privacy-preserving data analysis.
We design and develop big data algorithms and systems with both theoretical guarantees and practical usage.
  • [2019/12] Our MOOC Course "Algorithm Design and Analysis" was online!
  • [2019/12] Congratulations to Hao Cheng and Libin Wang on finishing their Masters!
  • [2019/12] Our paper "Federated LDA" was accepted by AAAI 2020.
  • [2019/11] Our paper on spatial crowdsourcing differential privacy was accepted ICDE 2020.
  • [2019/09] We successfully organized the CCF Advanced Disciplines Lectures (CCF ADL) "Crowd Intelligent Computing".
  • More...

Faculty

yxtong(AT)buaa.edu.cn
kexu(AT)nlsde.buaa.edu.cn
Yi Xu
xuy(AT)buaa.edu.cn

Ph.D Students

Dawei Gao
Ph.D. 2016-present
Tianshu Song
Ph.D. 2016-present
Qian Tao
Ph.D. 2016-present
Ph.D. 2017-present
Yansheng Wang
Ph.D. 2020-present
Kaining Zhang
Ph.D. 2020-present

Master Students

Wenqiang Li
Master. 2018-present
Yuguang Song
Master. 2018-present
Master. 2018-present
Master. 2019-present
Wangdong Liao
Master. 2019-present
Bingchen Song
Master. 2019-present
Yidi Wei
Master. 2019-present
Lipeng Zhang
Master. 2019-present
Wenbin Zhang
Master. 2019-present
Qiaoyang Liu
Master
2020-present
Xuchen Pan
Master
2020-present
Dingyuan Shi
Master
2020-present
Shuyue Wei
Master
2020-present
Ruisheng Zhang
Master
2020-present

Undergraduate Interns

Undergraduate
2017-present
Chunbo Xue
Undergraduate
2017-present
Wenhao Zhang
Undergraduate
2017-present
Boming Zhao
Undergraduate
2017-present

Alumni

Qinyi Wang
Master. 2014-2017
Now: Scientific Research Management, NWPU
Yuxiang Zeng
Master. 2014-2017
Now: Ph.D., HKUST
Liefeng Rong
Master. 2015-2018
Now: Aliyun
Hao Cheng
Master. 2017-2020
Now: Ph.D., UMD
Libin Wang
Master. 2017-2020
Now: Ph.D., HKUST

2020

  1. [VLDBJ 2020] Yongxin Tong, Zimu Zhou, Yuxiang Zeng, Lei Chen, Cyrus Shahabi. "Spatial Crowdsourcing: A Survey", The VLDB Journal, 29(1): 217–250, January 2020. PDF
  2. [TKDE 2020] Yongxin Tong, Yuxiang Zeng, Bolin Ding, Libin Wang, Lei Chen. "Two-Sided Online Micro-Task Assignment in Spatial Crowdsourcing", to appear in IEEE Transactions on Knowledge and Data Engineering, 2020. PDF Code&Data
  3. [TKDE 2020] Chen Jason Zhang, Lei Chen, H. V. Jagadish, Mengchen Zhang, Yongxin Tong. "Reducing Uncertainty of Schema Matching via Crowdsourcing with Accuracy Rates", IEEE Transactions on Knowledge and Data Engineering, 32(1): 135-151, 2020. PDF
  4. [VLDB 2020] Yuxiang Zeng, Yongxin Tong, Lei Chen. "Last-Mile Delivery Made Practical: An Efficient Route Planning Framework with Theoretical Guarantees", Proceedings of the 46th International Conference on Very Large Databases, Tokyo, Japan, August 31-September 4, 2020. PDF Code&Data
  5. [SIGKDD 2020] Dawei Gao, Xiaoxi He, Zimu Zhou, Yongxin Tong, Ke Xu, Lothar Thiele. "Rethinking Pruning for Accelerating Deep Inference At the Edge", to appear in Proceedings of the 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, San Diego, California, USA, August 23-27, 2020. PDF
  6. [ICDE 2020] Qian Tao, Yongxin Tong, Zimu Zhou, Yexuan Shi, Lei Chen, Ke Xu. "Differentially Private Online Task Assignment in Spatial Crowdsourcing: A Tree-based Approach", in Proceedings of the 36th International Conference on Data Engineering, Dallas, Texas, USA, April 20-24, 2020. PDF Slides
  7. [AAAI 2020] Yansheng Wang, Yongxin Tong, Dingyuan Shi. "Federated Latent Dirichlet Allocation: A Local Differential Privacy Based Framework", in Proceedings of the 34th AAAI Conference on Artificial Intelligence, New York, USA, January 7-12, 2020. PDF Poster
  8. [GEIN 2020] Tianshu Song, Ke Xu, Jiangneng Li, Yiming Li, Yongxin Tong. "Multi-skill Aware Task Assignment in Real-time Spatial Crowdsourcing", GeoInformatica, 24(1): 153–173, January 2020. PDF
  9. [GEIN 2020] Yiming Li, Jingzhi Fang, Yuxiang Zeng, Balz Maag, Yongxin Tong, Lingyu Zhang. "Two-sided Online Bipartite Matching in Spatial Data: Experiments and Analysis", GeoInformatica, 24(1): 175–198, January 2020. PDF Code&Data

2019

  1. [TIST 2019] Qiang Yang, Yang Liu, Tianjian Chen, Yongxin Tong. "Federated Machine Learning: Concept and Applications", ACM Transactions on Intelligent Systems and Technology, 10(2): 12:1-12:19, February 2019. PDF
  2. [BigData 2019] Tianshu Song, Yongxin Tong, Shuyue Wei. "Profit Allocation for Federated Learning", in Proceedings of 2019 IEEE International Conference on Big Data, Los Angeles, CA, USA, pp. 2577-2586, December 9-12 2019. PDF Slides Code&Data Video
  3. [CIKM 2019] Di Jiang, Yuanfeng Song, Yongxin Tong, Xueyang Wu, Weiwei Zhao, Qian Xu, Qiang Yang. "Federated Topic Modeling", in Proceedings of the 28th ACM International Conference on Information and Knowledge Management, Beijing, China, November 3-7, 2019. PDF
  4. [CIKM 2019] Lingyu Zhang, Tianshu Song, Yongxin Tong, Zimu Zhou, Dan Li, Wei Ai, Lulu Zhang, Guobin Wu, Yan Liu, Jieping Ye. "Recommendation-based Team Formation for On-demand Taxi-calling Platforms", in Proceedings of the 28th ACM International Conference on Information and Knowledge Management, Beijing, China, November 3-7, 2019. PDF
  5. [SIGKDD 2019] Hao Liu, Yongxin Tong, Panpan Zhang, Xinjiang Lu, Jianguo Duan, Hui Xiong. "Hydra: A Personalized and Context-Aware Multi-Modal Transportation Recommendation System", in Proceedings of the 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Pages 2314-2324, Anchorage, Alaska, USA, August 4-8, 2019. PDF
  6. [ICDE 2019] Yi Xu, Yongxin Tong, Yexuan Shi, Qian Tao, Ke Xu, Wei Li. "An Efficient Insertion Operator in Dynamic Ridesharing Services", in Proceedings of the 35th International Conference on Data Engineering, Pages 1022-1033, Macau, China, April 8-12, 2019. PDF Slides
  7. [ICDE 2019] Yansheng Wang, Yongxin Tong, Cheng Long, Pan Xu, Ke Xu, Weifeng Lv. "Adaptive Dynamic Bipartite Graph Matching: A Reinforcement Learning Approach", in Proceedings of the 35th International Conference on Data Engineering, Pages 1478-1489, Macau, China, April 8-12, 2019. PDF Slides
  8. [ICDE 2019] Libin Wang, Yongxin Tong, Chunming Hu, Lei Chen, Yiming Li. "Procrastination-aware Scheduling: A Bipartite Graph Perspective" (Poster Paper), in Proceedings of the 35th International Conference on Data Engineering, Pages 1650-1653, Macau, China, April 8-12, 2019. PDF
  9. [ICDE 2019] Yongxin Tong, Lei Chen, Zimu Zhou, H. V. Jagadish, Lidan Shou, Weifeng Lv. "SLADE: A Smart Large-Scale Task Decomposer in Crowdsourcing" (Extended Abstract), in Proceedings of the 35th International Conference on Data Engineering, Pages 2133-2134, Macau, China, April 8-12, 2019. PDF
  10. [AAAI 2019] Boming Zhao, Pan Xu, Yexuan Shi, Yongxin Tong, Zimu Zhou, Yuxiang Zeng. "Preference-Aware Task Assignment in On-demand Taxi Dispatching: An Online Stable Matching Approach", in Proceedings of the 33rd AAAI Conference on Artificial Intelligence, Pages 2245-2252, Honolulu, Hawaii, USA, January 27-February 1, 2019. PDF Code&Data
  11. [AAAI 2019] Pan Xu, Yexuan Shi, Hao Cheng, John Dickerson, Karthik Abinav Sankararaman, Aravind Srinivasan, Yongxin Tong, Leonidas Tsepenekas. "A Unified Approach to Online Matching with Conflict-Aware Constraints", in Proceedings of the 33rd AAAI Conference on Artificial Intelligence, Pages 2221-2228, Honolulu, Hawaii, USA, January 27-February 1, 2019. PDF

2018

  1. [TKDE 2018] Yongxin Tong, Lei Chen, Zimu Zhou, H. V. Jagadish, Lidan Shou, Weifeng Lv. "SLADE: A Smart Large-Scale Task Decomposer in Crowdsourcing", IEEE Transactions on Knowledge and Data Engineering, 30(8): 1588-1601, August 2018. PDF
  2. [SIGSPATIAL Special 2018] Yongxin Tong, Zimu Zhou. "Dynamic Task Assignment in Spatial Crowdsourcing", SIGSPATIAL Special, 10(2): 18-25, July 2018. PDF
  3. [VLDB 2018] Yongxin Tong, Yuxiang Zeng, Zimu Zhou, Lei Chen, Jieping Ye, Ke Xu. "A Unified Approach to Route Planning for Shared Mobility", Proceedings of the VLDB Endowment, 11(11): 1633-1646, Rio de Janeiro, Brazil, August 27-31, 2018. PDF Slides Code&Data
  4. [SIGKDD 2018] Bowen Du, Yongxin Tong, Zimu Zhou, Qian Tao, Wenjun Zhou. "Demand-Aware Charger Planning for Electric Vehicle Sharing", in Proceedings of the 24th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Pages 1330-1338, London, United Kingdom, August 19-23, 2018. PDF
  5. [SIGMOD 2018] Yongxin Tong, Libin Wang, Zimu Zhou, Lei Chen, Bowen Du, Jieping Ye. "Dynamic Pricing in Spatial Crowdsourcing: A Matching-Based Approach", in Proceedings of the 37th ACM SIGMOD International Conference on Management of Data, Pages 773-788, Houston, TX, USA, June 10-15, 2018. PDF Slides
  6. [DASFAA 2018] Qian Tao, Yuxiang Zeng, Zimu Zhou, Yongxin Tong, Lei Chen, Ke Xu. "Multi-worker-aware Task Planning in Real-time Spatial Crowdsourcing", in Proceedings of the 23rd International Conference on Database Systems for Advanced Applications, Pages 301-317, Gold Coast, Australia, May 21-24, 2018. PDF Code&Data
  7. [ICDE 2018] Yuxiang Zeng, Yongxin Tong, Lei Chen, Zimu Zhou. "Latency-oriented Task Completion via Spatial Crowdsourcing", in Proceedings of the 34th International Conference on Data Engineering, Pages 317-328, Paris, France, April 16-20, 2018. PDF Slides Code&Data
  8. [ICDE 2018] Zheng Liu, Lei Chen, Yongxin Tong. "Realtime Traffic Speed Estimation with Sparse Crowdsourced Data", in Proceedings of the 34th International Conference on Data Engineering, Pages 329-340, Paris, France, April 16-20, 2018. PDF

2017

  1. [TKDE 2017] Rui Meng, Lei Chen, Yongxin Tong, Chen Jason Zhang. "Knowledge Base Semantic Integration Using Crowdsourcing", IEEE Transactions on Knowledge and Data Engineering, 29(5): 1087-1100, May 2017. PDF
  2. [DSE 2017] Dawei Gao, Yongxin Tong, Jieying She, Tianshu Song, Lei Chen, Ke Xu. "Top-k Team Recommendation and Its Variants in Spatial Crowdsourcing", Data Science and Engineering, 2(2): 136-150, June 2017. PDF
  3. [NeuroComp 2017] Yong Chen, Hui Zhang, Yongxin Tong, Ming Lu. "Diversity Regularized Latent Semantic Match for Hashing", Neurocomputing, 230: 77–87, March 2017. PDF
  4. [VLDB 2017] Yongxin Tong, Lei Chen, Cyrus Shahabi. "Spatial Crowdsourcing: Challenges, Techniques, and Applications", Proceedings of the VLDB Endowment, 10(12): 1988-1991, Munich, Germany, August 28 - September 1, 2017. (Tutorial) PDF Slides
  5. [VLDB 2017] Yongxin Tong, Libin Wang, Zimu Zhou, Bolin Ding, Lei Chen, Jieping Ye, Ke Xu. "Flexible Dynamic Task Assignment in Real-Time Spatial Data", Proceedings of the VLDB Endowment, 10(11): 1334-1345, Munich, Germany, August 28 - September 1, 2017. PDF Slides Code&Data Poster
  6. [SIGKDD 2017] Yongxin Tong, Yuqiang Chen, Zimu Zhou, Lei Chen, Jie Wang, Qiang Yang, Jieping Ye, Weifeng Lv. "The Simpler The Better: A Unified Approach to Predicting Original Taxi Demands on Large-Scale Online Platforms", in Proceedings of the 23rd ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Pages 1653-1662, Halifax, Nova Scotia, Canada, August 13 - 17, 2017. PDF Slides Poster
  7. [APWeb-WAIM 2017] Dawei Gao, Yongxin Tong, Yudian Ji, Ke Xu. "Team-Oriented Task Planning in Spatial Crowdsourcing", in Proceedings of the 1st Asia Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint Conference on Web and Big Data, Pages 41-56, Beijing, China, July 7 - 9, 2017. PDF
  8. [SIGMOD 2017] Jieying She, Yongxin Tong, Lei Chen, Tianshu Song. "Feedback-Aware Social Event-Participant Arrangement", in Proceedings of the 36th ACM SIGMOD International Conference on Management of Data, Pages 851-865, Chicago, IL, USA, May 14-19, 2017. PDF Slides Poster
  9. [ICDE 2017] Tianshu Song, Yongxin Tong, Libin Wang, Jieying She, Bin Yao, Lei Chen, Ke Xu. "Trichromatic Online Matching in Real-Time Spatial Crowdsourcing", in Proceedings of the 33rd International Conference on Data Engineering, Pages 1009-1020, Diego, California, USA, April 19-22, 2017. PDF Slides Poster

2016

  1. [TOIS 2016] Di Jiang, Yongxin Tong, Yuanfeng Song. "Cross-Lingual Topic Discovery from Multilingual Search Engine Query Log", ACM Transactions on Information Systems, 35(2): 9:1-9:28, December 2016. PDF
  2. [WWWJ 2016] Yongxin Tong, Jieying She, Rui Meng. "Bottleneck-Aware Arrangement over Event-Based Social Networks: The Max-Min Approach", World Wide Web Journal, 19(6): 1151-1177, November 2016. PDF
  3. [TKDE 2016] Jieying She, Yongxin Tong, Lei Chen, Caleb Chen Cao. "Conflict-Aware Event-Participant Arrangement and its Variant for Online Setting", IEEE Transactions on Knowledge and Data Engineering, 28(9): 2281-2295, September 2016. PDF
  4. [WWWJ 2016] Yongxin Tong, Xiaofei Zhang, Lei Chen. "Tracking Frequent Items over Distributed Probabilistic Data", World Wide Web Journal, 19(4): 579-604, July 2016. PDF
  5. [VLDB 2016] Yongxin Tong, Jieying She, Bolin Ding, Lei Chen, Tianyu Wo, Ke Xu. "Online Minimum Matching in Real-Time Spatial Data: Experiments and Analysis", Proceedings of the VLDB Endowment, 9(12): 1053-1064, New Delhi, India, September 5-9, 2016. PDF Slides Poster
  6. [WAIM 2016] Dawei Gao, Yongxin Tong, Jieying She, Tianshu Song, Lei Chen, Ke Xu. "Top-k Teams Recommendation in Spatial Crowdsourcing", in Proceedings of the 17th International Conference on Web-Age Information Management, Pages 191-204, Nanchang, Jiangxi, China, June 3-5, 2016. PDF [Best Paper Award]
  7. [WAIM 2016] Qinyi Wang, Jieying She, Tianshu Song, Yongxin Tong, Lei Chen, Ke Xu. "Adjustable Time-Window-based Event Detection on Twitter", in Proceedings of the 17th International Conference on Web-Age Information Management, Pages 265-278, Nanchang, Jiangxi, China, June 3-5, 2016. PDF
  8. [ICDE 2016] Yongxin Tong, Jieying She, Bolin Ding, Libin Wang, Lei Chen. "Online Mobile Micro-Task Allocation in Spatial Crowdsourcing", in Proceedings of the 32nd International Conference on Data Engineering, Pages 49-60, Helsinki, Finland, May 16-20, 2016. PDF Slides Code&Data Poster

2015

  1. [JCST 2015] Yongxin Tong, Jieying She, Lei Chen. "Towards Better Understanding of App Functions", Journal of Computer Science and Technology, 30(5): 1130-1140, September 2015. PDF
  2. [DAPD 2015] Xiaofei Zhang, Yongxin Tong, Lei Chen, Min Wang, Shicong Feng. "Locality-aware Allocation of Multi-dimensional Correlated Files on the Cloud Platform", Distributed and Parallel Databases: An International Journal, 33(3): 353-380, September 2015. PDF
  3. [JCST 2015] Yongxin Tong, Lei Chen, Jieying She. "Mining Frequent Itemsets in Correlated Uncertain Databases", Journal of Computer Science and Technology, 30(4): 696-712, July 2015. PDF
  4. [ICDM 2015] Rui Meng, Yongxin Tong, Lei Chen, Caleb Chen Cao. "CrowdTC: Crowdsourced Taxonomy Construction", in Proceedings of the 15th International Conference on Data Mining, Pages 913-918, Atlantic City, USA, November 11-14, 2015. PDF
  5. [SIGMOD 2015] Jieying She, Yongxin Tong, Lei Chen. "Utility-aware Social Event-Participant Planning", in Proceedings of the 34th ACM SIGMOD International Conference on Management of Data, Pages 1629-1643, Melbourne, Australia, May 31- June 4, 2015. PDF Slides Poster
  6. [ICDE 2015] Jieying She, Yongxin Tong, Lei Chen, Caleb Chen Cao. "Conflict-Aware Event-Participant Arrangement", in Proceedings of the 31st International Conference on Data Engineering, Pages 735-746, Seoul, Korea, April 13-16, 2015. PDF Slides Poster
  7. [ICDE 2015] Chen Jason Zhang, Lei Chen, Yongxin Tong, Zheng Liu. "Cleaning Uncertain Data with a Noisy Crowd", in Proceedings of the 31st International Conference on Data Engineering, Pages 6-17, Seoul, Korea, April 13-16, 2015. PDF
  8. [SSEPM 2015] Yongxin Tong, Rui Meng, Jieying She. "On Bottleneck-Aware Arrangement for Event-Based Social Networks", in Proceedings of the 1st International Workshop on Scalable Social Event Processing and Management, Pages 735-746, Seoul, Korea, April 13, 2015. (Invited) PDF

2014 and Before

  1. [CIKM 2014] Yongxin Tong, Xiaofei Zhang, Caleb Chen Cao, Lei Chen. "Efficient Probabilistic Supergraph Search over Large Uncertain Graphs", in Proceedings of the 23rd ACM International Conference on Information and Knowledge Management, Pages 809-818, Shanghai, China, November 3-7, 2014. PDF
  2. [CIKM 2014] Chen Jason Zhang, Lei Chen, Yongxin Tong. "MaC: A Probabilistic Framework For Query Answering With Machine-Crowd Collaboration", in Proceedings of the 23rd ACM International Conference on Information and Knowledge Management, Pages 11-20, Shanghai, China, November 3-7, 2014. PDF
  3. [VLDB 2014] Chen Jason Zhang, Yongxin Tong, Lei Chen. "Where To: Crowd-Aided Path Selection", Proceedings of the VLDB Endowment, 7(14): 2005-2016, Hangzhou, China, September 1-5, 2014. PDF
  4. [VLDB 2014] Zhao Chen, Rui Fu, Ziyuan Zhao, Zheng Liu, Leihao Xia, Lei Chen, Peng Cheng, Caleb Chen Cao, Yongxin Tong, Chen Jason Zhang. "gMission: A General Spatial Crowdsourcing Platform" (Demo Paper), Proceedings of the VLDB Endowment, 7(13): 1629-1632, Hangzhou, China, September 1-5, 2014. PDF [Excellent Demonstration Award]
  5. [SIGKDD 2014] Yongxin Tong, Caleb Chen Cao, Lei Chen. "TCS: Efficient Topic Discovery over Crowd-oriented Service Data", in Proceedings of the 19th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Pages 861-870, New York, USA, August 24-27, 2014. PDF
  6. [ICDE 2014] Yongxin Tong, Caleb Chen Cao, Chen Jason Zhang, Yatao Li, Lei Chen. "CrowdCleaner: Data Cleaning for Multi-Version Data on the Web via Crowdsourcing" (Demo Paper), in Proceedings of the 30th International Conference on Data Engineering, Pages 1182-1185, Chicago, IL, USA, March 31-April 4, 2014. PDF Poster
  7. [SIGKDD 2013] Caleb Chen Cao, Yongxin Tong, Lei Chen, H. V. Jagadish. "WiseMarket: A New Paradigm for Managing Wisdom of Online Social Users", in Proceedings of the 19th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Pages 455-463, Chicago, Illinois, August 11-14, 2013. PDF Slides
  8. [ICDE 2013] Xiaofei Zhang, Lei Chen, Yongxin Tong, Min Wang. "EAGRE: Towards Scalable I/O Efficient SPARQL Query Evaluation on the Cloud", in Proceedings of the 29th International Conference on Data Engineering, Pages 565-576, Brisbane, Australia, April 8-11, 2013. PDF
  9. [VLDB 2012] Yongxin Tong, Lei Chen, Yurong Cheng, Philip S. Yu. "Mining Frequent Itemsets over Uncertain Databases", Proceedings of the VLDB Endowment, 5(11): 1650-1661, Istanbul, Turkey, August 27-31, 2012. PDF Slides Executable Code and Datasets
  10. [VLDB 2012] Caleb Chen Cao, Jieying She, Yongxin Tong, Lei Chen. "Whom to Ask? Jury Selection for Decision Making Tasks on Micro-blog Services", Proceedings of the VLDB Endowment, 5(11): 1495-1506, Istanbul, Turkey, August 27-31, 2012. PDF Slides
  11. [SIGKDD 2012] Yongxin Tong, Lei Chen, Philip S. Yu. "UFIMT: An Uncertain Frequent Itemset Mining Toolbox" (Demo Paper), in Proceedings of the 18th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Pages 1508-1511, Beijing, China, August 12-16, 2012. PDF Poster
  12. [ICDE 2012] Yongxin Tong, Lei Chen, Bolin Ding. "Discovering Threshold-Based Frequent Closed Itemsets over Probabilistic Data", in Proceedings of the 28th International Conference on Data Engineering, Pages 270-281, Washington, DC, USA, April 1-5, 2012. PDF Slides
2020
2019
2018
2017
2016
2015
2014 and Before