禹晓辉教授 个人空间

禹晓辉   教授

山东省济南市高新区舜华路1500号山东大学软件园校区
Email: xyu@sdu.edu.cn


南京大学学士、香港中文大学硕士、加拿大多伦多大学博士,山东大学“齐鲁青年学者”特聘教授、博士生导师。

招生意向



研究方向


大数据管理与分析。近年来,在互联网、交通、生物、医学、天文等众多领域产生了具有海量、高速、多样等特点的大数据。对大数据的管理和分析大大超过了传统数据库系统的能力,必须对此展开深入的研究。研究涵盖大数据的管理架构、存储机制、查询算法;研究海量时空数据的处理;研究海量社会媒体数据的处理;研究面向大数据的数据挖掘模型和算法;研究云计算模式下的大数据管理关键技术;研究面向特定应用的大数据管理与分析系统的构建。

讲授课程


数据库系统
Advanced Database Systems

承担项目


主持项目

国家自然科学基金面上项目,面向微博的实时流数据处理平台和查询处理关键技术研究

国家自然科学基金面上项目,关系数据库上关键字查询的若干前沿问题研究

山东省自然科学基金重点项目,面向海量时空数据处理的云计算关键技术研究

国家信息通信国际创新园(CIIIC)信息通信技术研究院项目,GD-MAP:泛数据管理与分析平台的研究与开发

Managing Risk and Uncertainty in Query Optimization (Discovery Grant, Natural Sciences and Engineering Research Council of Canada)

Multi-Objective Query Optimization (IBM Centre for Advanced Studies Project)

发表论文


近期发表论文 (2008-)

Ziqiang Yu, Xiaohui Yu, Yang Liu, Ken Q. Pu. Scalable Distributed Processing of K Nearest Neighbor Queries over Moving Objects. Accepted for publication in IEEE Transactions on Knowledge and Data Engineering (TKDE), 2014.

Chong Yang, Xiaohui Yu, Yang Liu. Continuous KNN Join Processing for Real-time Recommendation. To appear in IEEE International Conference on Data Mining (ICDM), December 14-17, 2014.

Meng Chen, Xiaohui Yu, Yang Liu. NLPMM: a Next Location Predictor with Markov Modeling, in Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), May 13-16, 2014.

Xinyan Lou, Yang Liu, Xiaohui Yu. Traffic Session Identification Based on Statistical Language Model. In Proceedings of International Conference on Advanced Data Mining and Applications (ADMA), pp264-275, December 2013.

Yang Liu, Xiaohui Yu, Bing Liu, Zhongshuai Chen. Sentiment Analysis of Sentences with Modalities, in Proceedings of the International Workshop on Mining
Unstructured big data using Natural Language Processing (MNLP), co-located with CIKM, October 2013.

Jiaran Zhang, Xiaohui Yu, and Liwei Lin. DDSN: Duplicate Detection to Reduce Both Storage and Bandwidth Consumption, in Proceedings of the 2013 IEEE International Conference on Big Data, October 2013.

M. Kargar, A. An and X. Yu, Efficient Duplication Free and Minimal Keyword Search in Graphs, IEEE Transactions on Knowledge and Data Engineering (TKDE), online May 2013.

L. Lin, X. Yu, N. Koudas. Pollux: Towards Scalable Distributed Real-time Search on Microblogs, in Proceedings of the 16th International Conference on Extending Database Technology, (EDBT 2013), Genoa, Italy, March 18-22, 2013.

Y. Liu, X. Yu, A. An, X. Huang. Riding the Tide of Sentiment Change: Sentiment Analysis with Evolving Online Reviews, World Wide Web, Vol. 16, No. 4, June 2013.

Z. Yu, X. Yu, Y. Liu. Efficient Top-k Keyword Search over MultiDimensional Databases, in International Journal of Data Warehousing and Mining (IJDWM), Vol. 9, No. 3, 2013.

Z. Abul-Basher, Y. Feng, P. Godfrey, X. Yu, M. Kandil, D. Zilio, C. Zuzarte. Alternative Query Optimization for Workload Management, in DEXA, September 3-6, 2012.

X. Yu, H. Shi. CI-Rank: Ranking Keyword Search Results Based on Collective Importance, in Proceedings of the 28th IEEE International Conference on Data Engineering (ICDE 2012), Washington D.C. April 1-5, 2012

X. Yu, Y. Liu, X. Huang, A. An. Mining Online Reviews for Predicting Sales Performance: A Case Study in the Movie Domain, in IEEE Transactions on Knowledge and Data Engineering (TKDE), April 2012.

Y. Liu, X. Yu, X. Huang, A. An. Combining integrated sampling with SVM ensembles for learning from imbalanced datasets, Inf. Process. Manage. 47(4): 617-631 (2011)

X. Yu, J. Dong. Indexing High-Dimensional Data for Main-Memory Similarity Search, in Information Systems 35 (2010), pp. 825-843, Elsevier, November 2010. DOI:10.1016/j.is.2010.05.001

Y. Liu, X. Yu, X. Huang, A. An, S-PLSA+: Adaptive Sentiment Analysis with Application to Sales Performance Prediction, to appear in Proceedings of SIGIR 2010, July 19-23, 2010, Geneva, Switzerland. (poster)

X. Yu, Y. Liu, X. Huang, A. An. A Quality-Aware Model for Sales Prediction Using Reviews, in Proceedings of the 19th International World Wide Web Conference (WWW 2010), Raleigh, North Carolina, April 26-30, 2010. (poster)

X. Yu, H. Shi. Query Segmentation Using Conditional Random Fields, in Proceedings of the First International Workshop on Keyword Search on Structured Data (KEYS 2009), co-located with SIGMOD 2009, Providence, RI, June 28, 2009.

K. Pu, X. Yu. FRISK: Query Cleaning and Processing in Action, in Proceedings of 25th International Conference on Data Engineering (ICDE 2009), Shanghai, China, March 29-April 4, 2009.

Y. Liu, X. Huang, A. An, and X. Yu. Predicting the Helpfulness of Online Reviews,in Proceedings of 8th IEEE International Conference on Data Mining (ICDM 2008), Pisa, December, 2008.

Y. Liu, X. Huang, A. An, and X. Yu. HelpMeter: A Nonlinear Model for Predicting the Helpfulness of Online Reviews, in Proceedings of 2008 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2008), Sydney, December, 2008.

Y. Liu, X. Yu, X. Huang, A. An. Blog Data Mining: the Predictive Power of Sentiments, a chapter in L. Cao, P.S. Yu, C. Zhang, H. Zhang (eds.): Data Mining for Business Applications, Springer. 2008.

K. Pu, X. Yu. Keyword Query Cleaning, in the 34th International Conference on Very Large Data Bases (VLDB 2008), Auckland, New Zealand, August 2008.

M. Hadjieleftheriou, X. Yu, N. Koudas, D. Srivastava. Selectivity Estimation of Set Similarity Selection Queries, in the 34th International Conference on Very Large Data Bases (VLDB 2008), Auckland, New Zealand, August 2008.

专利
1. Apparatus, system, and method for performing fast approximate computation of statistics on query expressions, 美国专利号:US 7593931 B2
2. Method to estimate the number of distinct value combinations for a set of attributes in a database system,
美国专利号:US 8572067 B2
3. Selectivity estimation of set similarity selection queries, 美国专利号:US 8161046 B2

本人研究生从事的工作领域