报告题目:Apache AsterixDB: a Big Data Management System for Large-Scale Data Storage, Indexing, and Analytics
报告日期及时间: 2016年9月18日 10:00 AM
报告地点:B403
报告人: Professor Chen Li
报告人单位: UC Irvine
报告人简介: Chen Li is a professor in the Department of Computer Science at UC Irvine. He received his Ph.D. degree in Computer Science from Stanford University, and his M.S. and B.S. in Computer Science from Tsinghua University, China, respectively. His research interests are in the field of data management, including data cleaning, data integration, data-intensive computing, and text analytics. He was a recipient of an NSF CAREER Award, several test-of-time publication awards, and many other grants and industry gifts. He was once a part-time Visiting Research Scientist at Google. He founded a company SRCH2 to develop an open source search engine with high performance and advanced features. He is now a Senior Member of ACM and selected as the PC member of the Database Conferences in recent years, including VLDB, SIGMOD/PODS, ICDE, et al.
报告摘要: We will present Apache AsterixDB, a full-function Big Data Management System (BDMS) with a rich feature set that distinguishes it from the other platforms in today's open source Big Data software ecosystem. This feature set makes it ideally-suited to current needs including web data warehousing, social data storage and analysis, and a variety of other use cases related to "Big Data problems". This talk will provide a technical overview of the system, including its semi-structured NoSQL-style data model, a declarative query language, a parallel runtime engine called Hyracks, partitioned and LSM-based data storage, various indexing types (B+ tree, R tree, and inverted index), and support for traditional queries as well as spatial, temporal, and fuzzy queries. We will also report initial results on using AsterixDB to support large-scale data analytics and visualization with a live demo.
邀请人: 何炎祥教授,李文海副教授