Publications

    Book

    Managing and Mining Graph Data
    Springer Ed., 2010

    Charu Aggarwal, Haixun Wang

    Comprehensive survey driven book on graph data with chapters contributed by prominent researchers in the field.


    Table of Contents

    2011

  1. On Dimensionality Reduction of Massive Graphs for Indexing and Retrieval, by Charu Aggarwal and Haixun Wang, in the the 26th International Conference on Data Engineering (ICDE), 2011, Hannover, Germany.

  2. A Unified Approach for Computing Top-k Pairs in Multidimensional Space, by M. A. Cheema, Xuemin Lin, Haixun Wang, Jianmin Wang, and Wenjie Zhang, in the the 26th International Conference on Data Engineering (ICDE), 2011, Hannover, Germany.

    2010

  3. Optimizing Content Freshness of Relations Extracted From the Web Using Keyword Search, by Mohan Yang, Haixun Wang, Lipyeow Lim, and Min Wang, in the ACM International Conference on Management of Data (SIGMOD), 2010, Indianapolis, USA.

  4. Computing Label Constraint Reachability in Graph Databases, by Ruomin Jin, Hui Hong, Haixun Wang, Yang Xiang, and Ning Ruan, in the ACM International Conference on Management of Data (SIGMOD), 2010, Indianapolis, USA.

  5. An Algorithmic Approach to Event Summarization, by Peng Wang, Haixun Wang, and Wei Wang, in the ACM International Conference on Management of Data (SIGMOD), 2010, Indianapolis, USA.

  6. Leveraging Spatio-Temporal Redundancy for RFID Data Cleansing, by Haiquan Chen, Jeff Ku, Haixun Wang, and Min-Te Sun, in the ACM International Conference on Management of Data (SIGMOD), 2010, Indianapolis, USA.

  7. MapDupReducer: Detecting Near Duplicates over Massive Datasets, by Chaokun Wang, Jianmin Wang, Xuemin Lin, Haixun Wang, and Hongsong Li, in the ACM International Conference on Management of Data (SIGMOD), 2010, Indianapolis, USA. (demo)

  8. Incorporating Post-Click Behaviors into a Click Model, by Gang Wang, Feimin Zhong, Dong Wang, Zheng Chen, and Haixun Wang, in the ACM International Conference on Information Retrieval (SIGIR), 2010, Geneva, Switzerland.

  9. Adaptive Runtime Anomaly Prediction for Dynamic Hosting Infrastructures, by Yongmin Tan, Xiaohui Gu, and Haixun Wang, in the ACM Symposium on Principles of Distributed Computing (PODC), 2010, Switzerland.

  10. Managing and Mining Graph Data, by Charu Aggarwal and Haixun Wang, in the Springer, ISBN 1441960449, Feb 19 2010, first edition. (book)

  11. Cleansing Uncertain Databases Leveraging Aggregate Constraints, by Haiquan Chen, Jeff Ku, and Haixun Wang, in the 2nd Workshop on Management and mining Of UNcertain Data (MOUND), 2010, Long Beach, CA.

    2009

  12. Inverse Time Dependency in Convex Regularized Learning, by Zeyuan Zhu, Weizhu Chen, Gang Wang, and Haixun Wang, in the 9th IEEE International Conference on Data Mining (ICDM), 2009, Miami, USA. (Best Student Paper Runner Up)

  13. Learning to Rank with a Novel Kernel Perceptron Method, by Xue-wen Chen, Haixun Wang, and Xiaotong Lin, in the ACM 18th Conference on Information and Knowledge Management (CIKM), November 2009, Hong Kong, China.

  14. Semantic Query by Example, by Lipyeow Lim, Haixun Wang, and Min Wang, in the ACM 18th Conference on Information and Knowledge Management (CIKM), November 2009, Hong Kong, China.

  15. Query Integrity Assurance of Location-based Services Accessing Outsourced Spatial Databases, by Wei-Shinn Ku, Ling Hu, Cyrus Shahabi, and Haixun Wang, in the 11th International Symposium on Spatial and Temporal Databases, 2009, Aalborg, Denmark.

  16. An Integrated Data-Driven Framework for Computing System Management, by Tao Li, Wei Peng, Chang-shing Perng, Sheng Ma, and Haixun Wang, in the IEEE Transactions on Systems, Man, and Cybernetics - part A., 2009, .

  17. Concept Clustering for Evolving Data, by Shixi Chen, Haixun Wang, and Shuigeng Zhou, in the 24th International Conference on Data Engineering (ICDE), March 2009, Shanghai, China. (short paper)

  18. Weighted Proximity Best-Joins for Information Retrieval, by Risi Thonang, Hao He, AnHai Doan, Haixun Wang, and Jun Yang, in the 24th International Conference on Data Engineering (ICDE), March 2009, Shanghai, China. (full paper)

  19. Online Anomaly Prediction for Robust Cluster Systems, by Xiaohui Gu and Haixun Wang, in the 24th International Conference on Data Engineering (ICDE), March 2009, Shanghai, China. (full paper)

    2008

  20. Modeling and Querying E-Commerce Data in Hybrid Relational-XML DBMSs, by Lipyeow Lim, Haixun Wang, and Min Wang, in the 27th International Conference on Conceptual Modeling (ER), October 2008, Barcelona, Spain. (Best Paper Award)

  21. Time-Stamp Management and Query Execution in Data Stream Management Systems, by Yijian Bai, Hetal Thakkar, Haixun Wang, and Carlo Zaniolo, in the Journal of IEEE Internet Computing, December 2008, Vol. 12, No. 6, pages 13-21.

  22. Dual Encryption for Query Integrity Assurance, by Haixun Wang, Jian Yin, Chang-shing Perng, and Philip S. Yu, in the ACM 17th Conference on Information and Knowledge Management (CIKM), October 2008, Napa Valley, California.

  23. Clustering by Pattern Similarity, by Haixun Wang and Jian Pei, in the Journal of Computer Science and Technology (JCST), 2008, Vol. 23, No. 4, pages 481-496.

  24. Efficiently Answering Reachability Query on Very Large Directed Graphs, by Ruoming Jin, Yang Xiang, Ning Ruan, and Haixun Wang, in the ACM International Conference on Management of Data (SIGMOD), June 2008, Vancouver, Canada.

  25. Lock-Free Consistency Control for Web 2.0 Applications, by Jiangming Yang, Haixun Wang, Ning Gu, Yiming Liu, Chunsong Wang, and Qiwei Zhang, in the 17th International World Wide Web Conference (WWW), April 2008, Beijing, China.

  26. Location-based Spatial Query Processing in Wireless Broadcast Environment, by Wei-Shinn Ku, Roger Zimmermann, and Haixun Wang, in the IEEE Transactions on Mobile Computing (TMC), 2008, Vol. 7, No. 1.

  27. Fast Computing Reachability Labelings for Large Graphs with High Compression Rate, by Jiefeng Cheng, Jeffrey Xu Yu, Xuemin Lin, Haixun Wang, and Philip S. Yu, in the 11th International Conference on Extending Database Technology (EDBT), March 2008, Nantes, France.

  28. Providing Freshness Guarantees for Outsourced Databases, by Min Xie, Haixun Wang, Jian Yin, and Xiaofeng Meng, in the 11th International Conference on Extending Database Technology (EDBT), March 2008, Nantes, France.

  29. Stop Chasing Trends: Discovering High Order Models in Evolving Data, by Shixi Chen, Haixun Wang, Shuigeng Zhou, and Philip S. Yu, in the 23rd International Conference on Data Engineering (ICDE), April 2008, Cancun, Mexico. (full paper)

  30. Fast Graph Pattern Matching, by Jiefeng Cheng, Jeffrey Xu Yu, Bolin Ding, Philip S. Yu, and Haixun Wang, in the 23rd International Conference on Data Engineering (ICDE), April 2008, Cancun, Mexico. (full paper)

  31. A Sampling-Based Approach to Information Recovery, by Junyi Xie, Jun Yang, Yuguo Chen, Haixun Wang, and Philip S. Yu, in the 23rd International Conference on Data Engineering (ICDE), April 2008, Cancun, Mexico. (full paper)

    2007

  32. Unifying Data and Domain Knowledge Using Virtual Views, by Lipyeow Lim, Haixun Wang, and Min Wang, in the 33rd International Conference on Very Large Data Bases (VLDB), September 2007, Vienna, Austria.

  33. Integrity Auditing of Outsourced Data, by Min Xie, Haixun Wang, Jian Yin, and Xiaofeng Meng, in the 33rd International Conference on Very Large Data Bases (VLDB), September 2007, Vienna, Austria.

  34. Challenges and Experience in Prototyping a Multi-Modal Stream Analytic and Monitoring Application on System S (industry paper), by Kun-Lung Wu, Philip S. Yu, Bugra Gedik, Kristen Hildrum, Charu Aggarwal, Eric Bouillet, Wei Fan, David George, Xiaohui Gu, Gang Luo, and Haixun Wang, in the 33rd International Conference on Very Large Data Bases (VLDB), September 2007, Vienna, Austria. industry track paper

  35. Event Summarization for System Management, by Wei Peng, Charles Perng, Tao Li, and Haixun Wang, in the ACM Int'l Conf. on Knowledge Discovery and Data Mining (SIGKDD), August 2007, San Jose, USA.

  36. BLINKS: Ranked Keyword Searches on Graphs, by Hao He, Haixun Wang, Jun Yang, and Philip Yu, in the ACM International Conference on Management of Data (SIGMOD), June 2007, Beijing, China.

  37. Supporting Ranking and Clustering as Generalized Order-By and Group-By, by Chengkai Li, Min Wang, Lipyeow Lim, Haixun Wang, and Kevin Chang, in the ACM International Conference on Management of Data (SIGMOD), June 2007, Beijing, China.

  38. Load Shedding in Classifying Multi-Source Streaming Data: A Bayes Risk Approach, by Yijian Bai, Haixun Wang, and Carlo Zaniolo, in the 6th SIAM International Conference on Data Mining (SDM), April 2007, Minnesota, USA.

  39. Adaptive Load Diffusion for Multiway Windowed Stream Joins, by Xiaohui Gu, Philip S. Yu, and Haixun Wang, in the 22nd International Conference on Data Engineering (ICDE), April 2007, Istanbul, Turkey. (full paper)

  40. Computing Compressed Multidimensional Skyline Cubes Efficiently, by Jian Pei, Ada Wai-Chee Fu, Xuemin Lin, and Haixun Wang, in the 22nd International Conference on Data Engineering (ICDE), April 2007, Istanbul, Turkey. (full paper)

  41. GString: A Novel Approach for Efficient Search in Graph Databases, by Haoliang Jiang, Haixun Wang, Shuigeng Zhou, and Philip S. Yu, in the 22nd International Conference on Data Engineering (ICDE), April 2007, Istanbul, Turkey. (full paper)

  42. Location-based Spatial Queries with Data Sharing in Wireless Broadcast Environments (short paper), by Wei-Shinn Ku, Roger Zimmermann, and Haixun Wang, in the 22nd International Conference on Data Engineering (ICDE), April 2007, Istanbul, Turkey.

  43. Optimizing Timestamp Management in Data Stream Management Systems (short paper), by Yijian Bai, Hetal Thakkar, Haixun Wang, and Carlo Zaniolo, in the 22nd International Conference on Data Engineering (ICDE), April 2007, Istanbul, Turkey.

  44. Semantic Data Management: Towards Querying Data with their Meaning (short paper), by Lipyeow Lim, Haixun Wang, and Min Wang, in the 22nd International Conference on Data Engineering (ICDE), April 2007, Istanbul, Turkey.

  45. A Flexible Query Graph Based Model for the Efficient Execution of Continuous Queries, by Yijian Bai, Hetal Thakkar, Haixun Wang, and Carlo Zaniolo, in the First International Workshop on Scalable Stream Processing Systems (SSPS), April 2007, Istanbul, Turkey.

  46. A Low Granularity Classifier for Data Streams with Concept Drifts and Biased Class Distribution, by Peng Wang, Haixun Wang, Xiaochen Wu, Wei Wang, and Baile Shi, in the IEEE Transactions on Knowledge and Data Engineering (TKDE), 2007, Vol. 19, No. 9.

    2006

  47. LOCI: Load Shedding through Class-Preserving Data Acquisition, by Peng Wang, Haixun Wang, Wei Wang, Baile Shi, and Philip S. Yu, in the 6th IEEE International Conference on Data Mining (ICDM), December 2006, Hong Kong, China. (full paper)

  48. Fast Relevance Discovery in Time Series (short paper), by Chang-shing Perng, Haixun Wang, and Sheng Ma, in the 6th IEEE International Conference on Data Mining (ICDM), December 2006, Hong Kong, China.

  49. A Balanced Ensemble Approach to Weighting Classifiers for Text Classification (short paper), by Gabriel Pui Cheong Fung, Jeffrey Yu, Haixun Wang, Huan Liu, and David W Cheung, in the 6th IEEE International Conference on Data Mining (ICDM), December 2006, Hong Kong, China.

  50. Predictive Learning on Data Streams (tutorial), by Haixun Wang and Ying Yang, in the 6th IEEE International Conference on Data Mining (ICDM), December 2006, Hong Kong, China.

  51. A Data Stream Language and System Designed for Power and Extensibility, by Yijian Bai, Hetal Thakkar, Richard Luo, Haixun Wang, and Carlo Zaniolo, in the ACM Conference on Information and Knowledge Management (CIKM), November 2006, Arlington, USA.

  52. Discovering Frequent Closed Partial Orders from Strings, by Jian Pei, Haixun Wang, Jian Liu, Ke Wang, Jianyong Wang, and Philip S. Yu, in the IEEE Transactions on Knowledge and Data Engineering (TKDE), 2006, Vol. 18, Iss. 11, Page(s): 1467 - 1481.

  53. Suppressing Model Overfitting in Mining Concept-Drifting Data Streams, by Haixun Wang, Jian Yin, Jian Pei, Philip S. Yu, and Jeffrey X. Yu, in the 12th ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD), August 2006, Philadelphia, USA.

  54. Finding Global Icebergs over Distributed Data Sets, by Qi Zhao, Mitsunori Ogihara, Haixun Wang, and Jun Xu, in the 25th ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS), June 2006, Chicago.

  55. Dual Labeling: Answering Graph Reachability Queries in Constant Time, by Haixun Wang, Hao He, Jun Yang, Philip Yu, and Jeffrey Xu Yu, in the 21st International Conference on Data Engineering (ICDE), April 2006, Atlanta, USA. (full paper)

  56. Fast Computation of Reachability Labeling for Large Graphs, by Jeffrey Xu Yu, Jiefeng Cheng, Xuemin Lin, Haixun Wang, and Philip Yu, in the 10th International Conference on Extending Database Technology (EDBT), Mar 2006, Munich, Germany.

  57. Catch the moment: maintaining closed frequent itemsets over a data stream sliding window, by Yun Chi, Haixun Wang, Philip S. Yu, and Richard R. Muntz, in the Knowledge Information Systtems, 2006, 10(3): 265-294.

  58. On Reducing Classifier Granularity in Mining Concept-Drifting Data Streams, by Peng Wang, Haixun Wang, Xiaochen Wu, Wei Wang, and Baile Shi, in the 5th IEEE International Conference on Data Mining (ICDM), November 2006, New Orleans, Louisiana, USA. (full paper, accept ratio: 11%)

  59. Effeciently Mining Frequent Closed Partial Orders, by Jian Pei, Jian Liu, Haixun Wang, Ke Wang, Philip S. Yu, and Jianyong Wang, in the 5th IEEE International Conference on Data Mining (ICDM), November 2006, New Orleans, Louisiana, USA. (short paper, accept ratio: 22%)

    2005

  60. An Improved Biclustering Method for Analyzing Gene Expression Profiles, by Jiong Yang, Haixun Wang, Wei Wang, and Philip S. Yu, in the International Journal on Artificial Intelligence Tools, 2005, Vol. 14, No. 5.

  61. Compact Reachability Labeling for Graph-Structured Data, by Hao He, Haixun Wang, Jun Yang, and Philip S. Yu, in the 14th ACM Conference on Information and Knowledge Management (CIKM), October 2005, Bremen, Germany. (accept ratio: 18%)

  62. A Random Method for Quantifying Changing Distributions in Data Streams, by Haixun Wang and Jian Pei, in the 9th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD), October 2005, Porto, Portugal.

  63. Pattern-based Similarity Search for Microarray Data, by Haixun Wang, Philip S. Yu, and Jian Pei, in the 11th ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD), August 2005, Chicago.

  64. Preference-based Frequent Pattern Mining, by Moonjung Cho, Jian Pei, Haixun Wang, and Wei Wang, in the International Journal of Data Warehousing and Mining, 2005, Vol. 1, No. 4.

  65. Demand-driven Frequent Itemset Mining Using Pattern Structures, by Haixun Wang, Chang-Shing Perng, Sheng Ma, and Philip S. Yu, in the Knowledge and Information Systems, 2005, Vol. 8, No. 1, pages 85-102.

  66. Loadstar: Load Shedding in Data Stream Mining (demo), by Yun Chi, Haixun Wang, and Philip S. Yu, in the 31th International Conference on Very Large Data Bases (VLDB), August 2005, Trondheim, Norway.

  67. A Native Extension of SQL for Mining Data Streams (demo), by Chang Luo, Haixun Wang, and Carlo Zaniolo, in the ACM International Conference on Management of Data (SIGMOD), June 2005, Baltimore, Maryland.

  68. On the Sequencing of Tree Structures for XML Indexing, by Haixun Wang and Xiaofeng Meng, in the 21st International Conference on Data Engineering (ICDE), April 2005, Tokyo, Japan. (full paper, accept ratio: 12%)

  69. Online Mining of Data Streams: Problems, Applications and Progress (tutorial), by Haixun Wang, Jian Pei, and Philip S. Yu, in the 21st International Conference on Data Engineering (ICDE), April 2005, Tokyo, Japan.

  70. Near-Neighbor Search in Pattern Distance Spaces, by Haixun Wang, Chang-Shing Perng, and Philip S. Yu, in the 4th SIAM International Conference on Data Mining (SDM), April 2005, Newport Beach, USA.

  71. Loadstar: A Load Shedding Scheme for Classifying Data Streams, by Yun Chi, Philip S. Yu, Haixun Wang, and Richard Muntz, in the 4th SIAM International Conference on Data Mining (SDM), April 2005, Newport Beach, USA.

  72. Sequential Risk Management in E-Business by Reinforcement Learning, by Naoki Abe, Edwin Pednault, Bianca Zadrozny, Haixun Wang, Wei Fan, Chid Apte, and (Book Chapter), in the Handbook of Integrated Risk Management for E-Business (A. Labbi, Ed.), 2005, ISBN 1-932159-07-X, J. Ross Publishing, Inc.

  73. Stay Current and Relevant in Data Mining Research (panel), by Haixun Wang and Wei Wang, in the 10th International Conference on Database Systems for Advanced Applications (DASFAA), April 2005, Beijing, China.

  74. Mining Data Streams, by Haixun Wang, Philip S. Yu, and Jiawei Han, in the The Data Mining and Knowledge Discovery Handbook, 2005, p777-792.

    2004

  75. Moment: Maintaining Closed Frequent Itemsets over a Stream Sliding Window, by Yun Chi, Haixun Wang, Philip Yu, and Richard Muntz, in the 4th IEEE International Conference on Data Mining (ICDM), November 2004, Brighton, UK.

  76. Estimating the Selectivity of XML Path Expression with Predicates by Histograms, by Yu Wang, Haixun Wang, Xiaofeng Meng, and Shan Wang, in the 5th International Conference on Web-Age Information Management (WAIM), July 2004, Dalian, China.

  77. Online Mining Data Streams: Problems, Applications and Progress (tutorial), by Jian Pei, Haixun Wang, and Philip S. Yu, in the 10th ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD), August 2004, Seattle, Washington.

  78. Query Languages and Data Models for Database Sequences and Data Streams, by Yan-Nei Law, Haixun Wang, and Carlo Zaniolo, in the 30th International Conference on Very Large Data Bases (VLDB), Sept 2004, Toronto, Canada.

  79. A Fast Algorithm for Subspace Clustering by Pattern Similarity, by Haixun Wang, Fang Chu, Wei Fan, Philip S Yu, and Jian Pei, in the 16th International Conference on Scientific and Statistical Database Management (SSDBM), June 2004, Santorini Island, Greece.

  80. XSeq: An Index Infrastructure for Tree Pattern Queries (demo), by Xiaofeng Meng, Yu Jiang, Yan Chen, and Haixun Wang, in the ACM International Conference on Management of Data (SIGMOD), June 2004, Paris, France.

  81. Active Mining of Data Streams, by Wei Fan, Yi-an Huang, Haixun Wang, and Philip S Yu, in the 3rd SIAM International Conference on Data Mining (SDM), April 2004, Florida, USA.

  82. Mining Extremely Skewed Security Trading Anomalies, by Wei Fan, Philip S Yu, and Haixun Wang, in the 9th International Conference on Extending Database Technology (EDBT), 2004, Crete, Greece.

  83. Toward Extensible Spatio-Temporal Databases: an approach based on User-Defined Aggregates, by Cindy Chen, Haixun Wang, and Carlo Zaniolo, in the "Flexible querying and reasoning in spatio-temporal databases: theory and applications, 2004, Springer Geosciences/Geoinformation series.

    2003

  84. ViST: A Dynamic Index Method for Querying XML Data by Tree Structures, by Haixun Wang, Sanghyun Park, Wei Fan, and Philip S Yu, in the ACM International Conference on Management of Data (SIGMOD), June 2003, San Diego, California, USA. ("http://wis.cs.ucla.edu/%7Ehxwang/system.html"

  85. Mining Concept-Drifting Data Streams using Ensemble Classifiers, by Haixun Wang, Wei Fan, Philip S Yu, and Jiawei Han, in the 9th ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD), August 2003, Washington DC, USA. (full paper)

  86. Indexing Weighted Sequences in Large Databases, by Haixun Wang, Chang-Shing Perng, Wei Fan, Sanghyun Park, and Philip S Yu, in the IEEE International Conference on Data Engineering (ICDE), 2003, Bangalore, India. (full paper)

  87. ATLaS: a Small but Complete SQL Extension for Data Mining and Data Streams (demo), by Haixun Wang, Carlo Zaniolo, and Chang R. Luo, in the 29th International Conference on Very Large Data Bases (VLDB), September 2003, Berlin, Germany.

  88. ATLaS: A Native Extension of SQL for Data Mining, by Haixun Wang and Carlo Zaniolo, in the Second SIAM International Conference on Data Mining (SDM), 2003, San Francisco.

  89. Inductive Learning in Less Than One Sequential Scan, by Wei Fan, Haixun Wang, Philip S Yu, and Shaw-hwa Lo, in the 18th International Joint Conference on Artificial Intelligence (IJCAI), August 2003, Acapulco, Mexico.

  90. Is random model better? Its accuracy and efficiency, by Wei Fan, Haixun Wang, Philip S Yu, and Sheng Ma, in the 3rd IEEE International Conference on Data Mining (ICDM), November 2003, Florida, USA. (full paper)

  91. MaPle: A Fast Algorithm for Maximal Pattern-based Clustering, by Jian Pei, Xiaoling Zhang, Moonjung Cho, Haixun Wang, and Philip S. Yu, in the 3rd IEEE International Conference on Data Mining (ICDM), November 2003, Florida, USA. (full paper)

  92. Incompleteness of Database Languages for Data Streams and Data Mining, by Carlo Zaniolo, Chang R. Luo, Yan N. Law, and Haixun Wang, in the Invited talk for 11th Italian Symposium on Advanced Database Systems (SEBD), June 2003, Cetraro.

  93. Online mining of changes from data streams: Research problems and preliminary results, by Guozhu Dong, Jiawei Han, Laks V. S. Lakshmanan, Jian Pei, Haixun Wang, and Philip S. Yu, in the ACM SIGMOD Workshop on Management and Processing of Data Streams (MPDS), 2003, San Diego, California, USA.

  94. Enhanced Biclustering on Gene Expression data, by Jiong Yang, Haixun Wang, Wei Wang, and Philip S. Yu, in the 3rd IEEE Symposium on Bioinformatics and Bioengineering (BIBE), 2003, Washington DC.

  95. The Deductive Database System LDL++, by Natraj Arni, KayLiang Ong, Shalom Tsur, Haixun Wang, and Carlo Zaniolo, in the Theory and Practice of Logic Programming, 2003, 3(1):61-94.

    2002

  96. Clustering by Pattern Similarity in Large Data Sets, by Haixun Wang, Wei Wang, Jiong Yang, and Philip S. Yu, in the ACM International Conference on Management of Data (SIGMOD), June 2002, Madison, Wisconsin, USA. ("http://wis.cs.ucla.edu/%7Ehxwang/system.html"

  97. A Framework for Scalable Cost-sensitive Learning Based on Combining Probabilities and Benefits, by Wei Fan, Haixun Wang, Philip S. Yu, and Salvatore Stolfo, in the Second SIAM International Conference on Data Mining (SDM), April 2002, Arlington, USA.

  98. ATLaS: a Powerful Database Language and System Based on Simple Extensions of SQL (short paper), by Haixun Wang and Carlo Zaniolo, in the 18th International Conference on Data Engineering (ICDE), 2002, San Jose, USA.

  99. Delta-Cluster: Capturing Subspace Correlation in a Large Data Set, by Jiong Yang, Wei Wang, Haixun Wang, and Philip S. Yu, in the 18th International Conference on Data Engineering (ICDE), 2002, San Jose, USA. (full paper)

  100. Mining Associations by Pattern Structure in Large Relational Tables, by Haixun Wang, Chang-Shing Perng, Sheng Ma, and Philip Yu, in the 2nd IEEE International Conference on Data Mining (ICDM), 2002, Maebashi, Japan.

  101. Empirical Comparison of Various Reinforcement Learning Strategies for Sequential Targeted Marketing, by Naoki Abe, Edwin Pednault, Haixun Wang, Bianca Zadrozny, Wei Fan, and Chidanand Apte, in the 2nd IEEE International Conference on Data Mining (ICDM), 2002, Maebashi, Japan.

  102. Progressive Modeling, by Wei Fan, Haixun Wang, Philip S. Yu, Shaw-hwa Lo, and Salvatore J. Stolfo, in the 2nd IEEE International Conference on Data Mining (ICDM), 2002, Maebashi, Japan.

  103. User-directed Exploration of Mining Space with Multiple Attributes, by Chang-Shing Perng, Haixun Wang, Sheng Ma, and Joseph Hellerstein, in the 2nd IEEE International Conference on Data Mining (ICDM), 2002, Maebashi, Japan.

  104. Sequential Cost-Sensitive Decision Making with Reinforcement Learning, by Edwin Pednault, Wei Fan, Haixun Wang, Naoki Abe, Bianca Zadrozny, and Chidanand Apte, in the 8th ACM SIGKDD International Conference on Data Mining, July 2002, Edmonton, Canada. (full paper)

  105. An Indexing Structure for Similarity Searching in Microarray Data, by Haixun Wang, Charles Perng, Wei Fan, and Philip S. Yu, in the Proceedings of the First IEEE Computer Society Bioinformatics Conference (CSB 2002), August 2002, Palo Alto, California, USA.

  106. Extending SQL for Decision Support Applications, by Haixun Wang and Carlo Zaniolo, in the Design and Management of Data Warehouses (DMDW 2002), May 2002, . (Keynote Address by Carlo Zaniolo)

  107. User-directed Discovery of Patterns in Multi-attribute Data, by Charles Perng, Haixun Wang, Sheng Ma, and Joe Hellerstein, in the KDD Explorations, 2002, Vol. 4, Iss. 1.

  108. Improving performance of bicluster discovery in a large data set, by Jiong Yang, Wei Wang, Haixun Wang, and Philip S. Yu, in the Proceedings of the 6th ACM International Conference on Research in Computational Molecular Biology (RECOMB), 2002, . (poster)

  109. Pruning and dynamic scheduling of cost-sensitive ensembles, by Wei Fan, Fang Chu, Haixun Wang, and Philip S. Yu, in the 18th National Conference on Artificial Intelligence (AAAI-02), July 2002, Edmonton, Canada.

  110. A Fully Distributed Framework for Cost-sensitive Data Mining, by Wei Fan, Haixun Wang, Philip S. Yu, and Salvatore Stolfo, in the the 22nd International Conference on Distributed Computing Systems (ICDCS), July 2002, Vienna, Austria.

    2001

  111. SSDT: A Scalable Subspace-Splitting Classifier for Biased Data, by Haixun Wang and Philip S. Yu, in the First IEEE International Conference on Data Mining (ICDM), 2001, San Jose, California.

  112. FARM: A Framework for Exploring Mining Spaces with Multiple Attributes, by Charles Perng, Haixun Wang, Sheng Ma, and Joe Hellerstein, in the First IEEE International Conference on Data Mining (ICDM), 2001, San Jose, California.

  113. The S2-Tree: An Index Structure for Subsequence Matching of Spatial Objects, by Haixun Wang and Charles Perng, in the 5th Pacific-Asic Conference on Knowledge Discovery and Data Mining (PAKDD), April 2001, Hong Kong.

    2000

  114. Using SQL to Build New Aggregates and Extenders for Object-Relational Systems, by Haixun Wang and Carlo Zaniolo, in the Proc. 26th Intl. Conf. on Very Large Databases (VLDB), September 2000, Cairo, Egypt.

  115. Database System Extensions for Decision Support: the AXL Approach, by Haixun Wang and Carlo Zaniolo, in the ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD), May 2000, Dallas, TX.

  116. CMP: A Fast Decision Tree Classifier Using Multivariate Predictions, by Haixun Wang and Carlo Zaniolo, in the 16th International Conference on Data Engineering (ICDE), 2000, San Diego, USA. (full paper)

  117. User Defined Aggregates in Object-Relational Systems, by Haixun Wang and Carlo Zaniolo, in the 16th International Conference on Data Engineering (ICDE), 2000, San Diego, USA. (full paper)

  118. Landmarks: a New Model for Similarity-based Pattern Querying in Time Series Databases, by Chang-Shing Perng, Haixun Wang, Sylvia R. Zhang, and D. Stott Parker, in the 16th International Conference on Data Engineering (ICDE), 2000, San Diego, USA. (full paper)

  119. Nonmonotonic Reasoning in LDL++: A Second-Generation Deductive Database System, by Haixun Wang and Carlo Zaniolo, in the (Book Chapter) Logic-Based Artificial Intelligence, (J. Minker, Ed.), 2000, .

    1999

  120. User Defined Aggregates in Database Languages, by Haixun Wang and Carlo Zaniolo, in the Seventh International Workshop on Database Programming Languages, September 1999, Scotland.

  121. User-Defined Aggregates for Datamining, by Haixun Wang and Carlo Zaniolo, in the ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD), May 1999, Philadelphia, USA.

    1998

  122. User-Defined Aggregates for Logical Data Languages, by Haixun Wang and Carlo Zaniolo, in the The Sixth International Workshop on Deductive Databases and Logic Programming, 1998, Manchester, UK.

  123. Logic-Based User-Defined Aggregates for the Next Generation of Database Systems, by Carlo Zaniolo and Haixun Wang, in the (Book Chapter) The Logic Programming Paradigm: Current Trends and Future Directions, K.R. Apt, V. Marek, M. Truszczynski, D.S.Warren (eds.), 1998, Springer Verlag.

    1996

  124. An Information Retrieval Algorithm for Database Applications (in Chinese), by Haixun Wang, Jinyuan You, Zhou Wang, and Kan Wang, in the Computer Engineering, 1996, Vol. 22, No.3.