Main Page

Table of Contents

Author Index

Co-Located Workshops

ACM SIGKDD Membership

KDD 2013 Conference Proceedings

Inderjit S. Dhillon, Yehuda Koren, Rayid Ghani, Ted E. Senator, Paul Bradley, Rajesh Parekh, Jingrui He, Robert L. Grossman, & Ramasamy Uthurusamy (editors), Proceedings of the 19th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD'13, August 11–14, 2013, Chicago, Illinois, USA. ACM 2013, ISBN 978-1-4503-2174-7.


Table of Contents

General Chairs' Welcome
Robert L. Grossman (University of Chicago)
Ramasamy Uthurusamy (General Motors Corp (retired))

Organization List

Senior Program Committee

Research Track Program Chairs' Welcome
Inderjit S. Dhillon (The University of Texas at Austin)
Yehuda Koren (Google)

Research Track Program Committee

Industry Government Track Program Committee

Industry Practice Expo Chairs' Welcome
Rajesh Parekh (Groupon)
Paul Bradley (MethodCare, Inc.)

Demo Track Program Committee

Research Track External Reviewers

Industry Government Track Program Chairs' Welcome
Rayid Ghani (University of Chicago)
Ted E. Senator (SAIC)

Sponsors & Supporters

Author Index


Keynote Session 1

Time Series and Spatial Data

Keynote Session 2

Unsupervised and Topic Learning

Keynote Session 3

Social and Information Networks

Keynote Session 4

Graph Mining and Sampling

Document and Topic Models

Rule and Pattern Mining

Social Media

Web Mining

Big Data Frameworks

Best Paper Session

Graph Mining

Research Poster Session

Classification

Industry Practice Expo Invited Presentations

Healthcare and Bioinformatics

Industry Government Track

Recommender Systems

Industry Government -- Deployed Presentations

Scalable Methods for Big Data

Industry Government -- Discovery Presentations

Temporal/Social Influence

Industry Government -- Emerging Presentations

Sparse Learning

Panel

Graph Clustering

Demonstrations

Diffusion in Social Networks

Tutorials

Keynote Session 1

Scale-Out Beyond Map-Reduce (Page 1)
Raghu Ramakrishnan (Microsoft)
Team Members CISL (Microsoft)

Keynote Session 2

The Online Revolution: Education for Everyone (Page 2)
Andrew Ng (Stanford University and Coursera)
Daphne Koller (Stanford University and Coursera)

Keynote Session 3

Optimization in Learning and Data Analysis (Page 3)
Stephen J. Wright (University of Wisconsin-Madison)

Keynote Session 4

Predicting the Present with Search Engine Data (Page 4)
Hal Varian (Google Inc.)

Document and Topic Models

One Theme in All Views: Modeling Consensus Topics in Multiple Contexts (Page 5)
Jian Tang (Peking University)
Ming Zhang (Peking University)
Qiaozhu Mei (University of Michigan)

Representing Documents Through Their Readers (Page 14)
Khalid El-Arini (Facebook)
Min Xu (Carnegie Mellon University)
Emily B. Fox (University of Washington)
Carlos Guestrin (University of Washington)

Text-Based Measures of Document Diversity (Page 23)
Kevin Bache (University of California, Irvine)
David Newman (University of California, Irvine)
Padhraic Smyth (University of California, Irvine)

Diversity Maximization Under Matroid Constraints (Page 32)
Zeinab Abbassi (Columbia University)
Vahab S. Mirrokni (Google Research)
Mayur Thakur (Google)

Social Media

Connecting Users Across Social Media Sites: A Behavioral-Modeling Approach (Page 41)
Reza Zafarani (Arizona State University)
Huan Liu (Arizona State University)

Automatic Selection of Social Media Responses to News (Page 50)
Tadej Štajner (Jožef Stefan Institute)
Bart Thomee (Yahoo! Research)
Ana-Maria Popescu (Research Consultant)
Marco Pennacchiotti (eBay, Inc.)
Alejandro Jaimes (Yahoo! Research)

Estimating Sharer Reputation via Social Data Calibration (Page 59)
Jaewon Yang (Stanford University)
Bee-Chung Chen (LinkedIn)
Deepak Agarwal (LinkedIn)

Linking Named Entities in Tweets with Knowledge Base via User Interest Modeling (Page 68)
Wei Shen (Tsinghua University)
Jianyong Wang (Tsinghua University)
Ping Luo (HP Labs China)
Min Wang (Google Research)

Big Data Frameworks

TurboGraph: A Fast Parallel Graph Engine Handling Billion-Scale Graphs in a Single PC (Page 77)
Wook-Shin Han (POSTECH)
Sangyeon Lee (POSTECH)
Kyungyeol Park (POSTECH)
Jeong-Hoon Lee (POSTECH)
Min-Soo Kim (DGIST)
Jinha Kim (POSTECH)
Hwanjo Yu (POSTECH)

Beyond Myopic Inference in Big Data Pipelines (Page 86)
Karthik Raman (Cornell University)
Adith Swaminathan (Cornell University)
Johannes Gehrke (Cornell University)
Thorsten Joachims (Cornell University)

Big Data Analytics with Small Footprint: Squaring the Cloud (Page 95)
John Canny (University of California, Berkeley)
Huasha Zhao (University of California, Berkeley)

Graph Mining

Denser Than the Densest Subgraph: Extracting Optimal Quasi-Cliques with Quality Guarantees (Page 104)
Charalampos E.A.P.A Tsourakakis (Carnegie Mellon University)
Francesco Bonchi (Yahoo! Research)
Aristides Gionis (Aalto University)
Francesco Gullo (Yahoo! Research)
Maria Tsiarli (University of Pittsburgh)

Guided Learning for Role Discovery (GLRD): Framework, Algorithms, and Applications (Page 113)
Sean Gilpin (University of California, Davis)
Tina Eliassi-Rad (Rutgers University)
Ian Davidson (University of California, Davis)

Redundancy-Aware Maximal Cliques (Page 122)
Jia Wang (Chinese University of Hong Kong)
James Cheng (Chinese University of Hong Kong)
Ada Wai-Chee Fu (Chinese University of Hong Kong)

Selective Sampling on Graphs for Classification (Page 131)
Quanquan Gu (University of Illinois at Urbana-Champaign)
Charu Aggarwal (IBM T.J. Watson Research Center)
Jialu Liu (University of Illinois at Urbana-Champaign)
Jiawei Han (University of Illinois at Urbana-Champaign)

Classification

Density-Based Logistic Regression (Page 140)
Wenlin Chen (Washington University in St. Louis)
Yixin Chen (Washington University in St. Louis)
Yi Mao (Xidian University)
Baolong Guo (Xidian University)

MI2LS: Multi-Instance Learning from Multiple Information Sources (Page 149)
Dan Zhang (Facebook Incorporation)
Jingrui He (Stevens Institute of Technology)
Richard Lawrence (IBM T.J. Watson Research Center)

Querying Discriminative and Representative Samples for Batch Mode Active Learning (Page 158)
Zheng Wang (Arizona State University)
Jieping Ye (Arizona State University)

SVMpAUC^tight: A New Support Vector Method for Optimizing Partial AUC Based on a Tight Convex Upper Bound (Page 167)
Harikrishna Narasimhan (Indian Institute of Science)
Shivani Agarwal (Indian Institute of Science)

Healthcare and Bioinformatics

Succinct Interval-Splitting Tree for Scalable Similarity Search of Compound-Protein Pairs with Property Constraints (Page 176)
Yasuo Tabei (Japan Science and Technology Agency)
Akihiro Kishimoto (IBM Research)
Masaaki Kotera (Kyoto University)
Yoshihiro Yamanishi (Kyushu University)

Multi-Source Learning with Block-Wise Missing Data for Alzheimer'Ss Disease Prediction (Page 185)
Shuo Xiang (Arizona State University)
Lei Yuan (Arizona State University)
Wei Fan (Huawei Noah's Ark Lab)
Yalin Wang (Arizona State University)
Paul M. Thompson (University of California, Los Angeles)
Jieping Ye (Arizona State University)

Network Discovery via Constrained Tensor Analysis of fMRI Data (Page 194)
Ian Davidson (University of California, Davis)
Sean Gilpin (University of California, Davis)
Owen Carmichael (University of California, Davis)
Peter Walker (United States Navy)

Recommender Systems

Learning to Question: Leveraging User Preferences for Shopping Advice (Page 203)
Mahashweta Das (University of Texas at Arlington)
Gianmarco De Francisci Morales (Yahoo! Research)
Aristides Gionis (Aalto University and HIIT)
Ingmar Weber (Qatar Computing Research Institute)

Active Learning and Search on Low-Rank Matrices (Page 212)
Dougal J. Sutherland (Carnegie Mellon University)
Barnabás Póczos (Carnegie Mellon University)
Jeff Schneider (Carnegie Mellon University)

LCARS: A Location-Content-Aware Recommender System (Page 221)
Hongzhi Yin (Peking University)
Yizhou Sun (Northeastern University)
Bin Cui (Peking University)
Zhiting Hu (Peking University)
Ling Chen (University of Technology, Sydney)

Scalable Methods for Big Data

Comparing Apples to Oranges: A Scalable Solution with Heterogeneous Hashing (Page 230)
Mingdong Ou (Tsinghua University)
Peng Cui (Tsinghua University)
Fei Wang (IBM Watson Research Center)
Jun Wang (IBM Watson Research Cente)
Wenwu Zhu (Tsinghua University)
Shiqiang Yang (Tsinghua University)

Fast and Scalable Polynomial Kernels via Explicit Feature Maps (Page 239)
Ninh Pham (IT University of Copenhagen)
Rasmus Pagh (IT University of Copenhagen)

Indexed Block Coordinate Descent for Large-Scale Linear Classification with Limited Memory (Page 248)
Ian E.-H. Yen (National Taiwan University)
Chun-Fu Chang (National Taiwan University)
Ting-Wei Lin (National Taiwan University)
Shan-Wei Lin (National Taiwan University)
Shou-De Lin (National Taiwan University)

Recursive Regularization for Large-Scale Classification with Hierarchical and Graphical Dependencies (Page 257)
Siddharth Gopal (Carnegie Mellon University)
Yiming Yang (Carnegie Mellon University)

Temporal/Social Influence

Discovering Latent Influence in Online Social Activities via Shared Cascade Poisson Processes (Page 266)
Tomoharu Iwata (University of Cambridge)
Amar Shah (University of Cambridge)
Zoubin Ghahramani (University of Cambridge)

STRIP: Stream Learning of Influence Probabilities (Page 275)
Konstantin Kutzkov (IT University of Copenhagen)
Albert Bifet (Yahoo! Research)
Francesco Bonchi (Yahoo! Research)
Aristides Gionis (Aalto University & HIIT)

Fast Structure Learning in Generalized Stochastic Processes with Latent Factors (Page 284)
Mohammad Taha Bahadori (University of Southern California)
Yan Liu (University of Southern California)
Eric P. Xing (Carnegie Mellon University)

Sparse Learning

Robust Sparse Estimation of Multiresponse Regression and Inverse Covariance Matrix via the L2 Distance (Page 293)
Aurélie C. Lozano (IBM T.J. Watson Research Center)
Huijing Jiang (IBM T.J. Watson Research Center)
Xinwei Deng (Virginia Tech)

Exact Sparse Recovery with L0 Projections (Page 302)
Ping Li (Cornell University)
Cun-Hui Zhang (Rutgers University)

Robust Principal Component Analysis via Capped Norms (Page 311)
Qian Sun (Arizona State University)
Shuo Xiang (Arizona State University)
Jieping Ye (Arizona State University)

Graph Clustering

Flexible and Robust Co-Regularized Multi-Domain Graph Clustering (Page 320)
Wei Cheng (University of North Carolina at Chapel Hill)
Xiang Zhang (Case Western Reserve University)
Zhishan Guo (University of North Carolina at Chapel Hill)
Yubao Wu (Case Western Reserve University)
Patrick F. Sullivan (University of North Carolina at Chapel Hill)
Wei Wang (University of California at Los Angeles)

Graph Cluster Randomization: Network Exposure to Multiple Universes (Page 329)
Johan Ugander (Cornell University)
Brian Karrer (Facebook)
Lars Backstrom (Facebook)
Jon Kleinberg (Cornell University)

Social Influence Based Clustering of Heterogeneous Information Networks (Page 338)
Yang Zhou (Georgia Institute of Technology)
Ling Liu (Georgia Institute of Technology)

Diffusion in Social Networks

Confluence: Conformity Influence in Large Social Networks (Page 347)
Jie Tang (Tsinghua University)
Sen Wu (Tsinghua University)
Jimeng Sun (IBM T.J. Watson Research Center)

The Role of Information Diffusion in the Evolution of Social Networks (Page 356)
Lilian Weng (Indiana University Bloomington)
Jacob Ratkiewicz (Google Inc.)
Nicola Perra (Northeastern University)
Bruno Gonçalves (Aix Marseille Université)
Carlos Castillo (Qatar Computing Research Institute)
Francesco Bonchi (Yahoo! Research Barcelona)
Rossano Schifanella (University of Torino)
Filippo Menczer (Indiana University, Bloomington)
Alessandro Flammini (Indiana University, Bloomington)

Information Cascade at Group Scale (Page 401)
Milad Eftekhar (University of Toronto)
Yashar Ganjali (University of Toronto)
Nick Koudas (University of Toronto)

Extracting Social Events for Learning Better Information Diffusion Models (Page 365)
Shuyang Lin (University of Illinois at Chicago)
Fengjiao Wang (University of Illinois at Chicago)
Qingbo Hu (University of Illinois at Chicago)
Philip S. Yu (University of Illinois at Chicago)

Time Series and Spatial Data

Model Selection in Markovian Processes (Page 374)
Assaf Hallak (Technion)
Dotan Di-Castro (Technion)
Shie Mannor (Technion)

DTW-D: Time Series Semi-Supervised Learning from a Single Example (Page 383)
Yanping Chen (University of California, Riverside)
Bing Hu (University of California, Riverside)
Eamonn Keogh (University of California, Riverside)
Gustavo E.A.P.A Batista (Universidade de São Paulo - USP)

Model-Based Kernel for Efficient Time Series Analysis (Page 392)
Huanhuan Chen (University of Science and Technology of China & University of Birmingham)
Fengzhen Tang (University of Birmingham)
Peter Tino (University of Birmingham)
Xin Yao (University of Birmingham)

Mining Lines in the Sand: On Trajectory Discovery from Untrustworthy Data in Cyber-Physical System (Page 410)
Lu-An Tang (University of Illinois at Urbana-Champaign)
Xiao Yu (University of Illinois at Urbana-Champaign)
Quanquan Gu (University of Illinois at Urbana-Champaign)
Jiawei Han (University of Illinois at Urbana-Champaign)
Alice Leung (BBN Technology)
Thomas La Porta (The Pennsylvania State University)

Unsupervised and Topic Learning

A General Bootstrap Performance Diagnostic (Page 419)
Ariel Kleiner (University of California, Berkeley)
Ameet Talwalkar (University of California, Berkeley)
Sameer Agarwal (University of California, Berkeley)
Ion Stoica (University of California, Berkeley)
Michael I. Jordan (University of California, Berkeley)

Subsampling for Efficient and Effective Unsupervised Outlier Detection Ensembles (Page 428)
Arthur Zimek (University of Alberta)
Matthew Gaudet (University of Alberta)
Ricardo J. G. B. Campello (University of Alberta)
Jörg Sander (University of Alberta)

A Phrase Mining Framework for Recursive Construction of a Topical Hierarchy (Page 437)
Chi Wang (University of Illinois at Urbana-Champaign)
Marina Danilevsky (University of Illinois at Urbana-Champaign)
Nihit Desai (University of Illinois at Urbana-Champaign)
Yinan Zhang (University of Illinois at Urbana-Champaign)
Phuong Nguyen (University of Illinois at Urbana-Champaign)
Thrivikrama Taula (University of Illinois at Urbana-Champaign)
Jiawei Han (University of Illinois at Urbana-Champaign)

Stochastic Collapsed Variational Bayesian Inference for Latent Dirichlet Allocation (Page 446)
James Foulds (University of California, Irvine)
Levi Boyles (University of California, Irvine)
Christopher DuBois (University of California, Irvine)
Padhraic Smyth (University of California, Irvine)
Max Welling (University of Amsterdam)

Social and Information Networks

WiseMarket: A New Paradigm for Managing Wisdom of Online Social Users (Page 455)
Caleb Chen Cao (The Hong Kong University of Science and Technology)
Yongxin Tong (The Hong Kong University of Science and Technology)
Lei Chen (The Hong Kong University of Science and Technology)
H. V. Jagadish (University of Michigan)

Multi-Label Relational Neighbor Classification Using Social Context Features (Page 464)
Xi Wang (University of Central Florida)
Gita Sukthankar (University of Central Florida)

Scalable Text and Link Analysis with Mixed-Topic Link Models (Page 473)
Yaojia Zhu (University of New Mexico)
Xiaoran Yan (University of New Mexico)
Lise Getoor (University of Maryland)
Cristopher Moore (Santa Fe Institute)

Collaborative Boosting for Activity Classification in Microblogs (Page 482)
Yangqiu Song (Hong Kong University of Science and Technology)
Zhengdong Lu (Noah's Ark Lab, Huawei)
Cane Wing-ki Leung (Noah's Ark Lab, Huawei)
Qiang Yang (Noah's Ark Lab, Huawei)

Graph Mining and Sampling

Trace Complexity of Network Inference (Page 491)
Bruno Abrahao (Cornell University)
Flavio Chierichetti (Sapienza University)
Robert Kleinberg (Cornell University)
Alessandro Panconesi (Sapienza University)

Debiasing Social Wisdom (Page 500)
Abhimanyu Das (Microsoft Research)
Sreenivas Gollapudi (Microsoft Research)
Rina Panigrahy (Microsoft Research)
Mahyar Salek (Microsoft Research)

Mining Discriminative Subgraphs From Global-State Networks (Page 509)
Sayan Ranu (IBM Research)
Minh Hoang (University of California)
Ambuj Singh (University of California)

Approximate Graph Mining with Label Costs (Page 518)
Pranay Anchuri (RPI)
Mohammed J. Zaki (RPI)
Omer Barkol (HP Labs)
Shahar Golan (HP Labs)
Moshe Shamy (HP Software)

Rule and Pattern Mining

Summarizing Probabilistic Frequent Patterns: A Fast Approach (Page 527)
Chunyang Liu (University of Technology, Sydney)
Ling Chen (University of Technology, Sydney)
Chengqi Zhang (University of Technology, Sydney)

Mining High Utility Episodes in Complex Event Sequences (Page 536)
Cheng-Wei Wu (National Cheng Kung University)
Yu-Feng Lin (National Cheng Kung University)
Philip S. Yu (University of Illinois at Chicago)
Vincent S. Tseng (National Cheng Kung University)

Mining Frequent Graph Patterns with Differential Privacy (Page 545)
Entong Shen (North Carolina State University)
Ting Yu (North Carolina State University)

Web Mining

Statistical Quality Estimation for General Crowdsourcing Tasks (Page 554)
Yukino Baba (The University of Tokyo)
Hisashi Kashima (The University of Tokyo)

Psychological Advertising: Exploring User Psychology for Click Prediction in Sponsored Search (Page 563)
Taifeng Wang (Microsoft Research Asia)
Jiang Bian (Microsoft Research Asia)
Shusen Liu (South China University of Technology)
Yuyu Zhang (Chinese Academy of Sciences)
Tie-Yan Liu (Microsoft Research Asia)

SiGMa: Simple Greedy Matching for Aligning Large Knowledge Bases (Page 572)
Simon Lacoste-Julien (INRIA, Paris)
Konstantina Palla (University of Cambridge)
Alex Davies (University of Cambridge)
Gjergji Kasneci (Microsoft Research)
Thore Graepel (Microsoft Research)
Zoubin Ghahramani (University of Cambridge)

Best Paper Session

Simple and Deterministic Matrix Sketching (Page 581)
Edo Liberty (Yahoo! Labs)

A Space Efficient Streaming Algorithm for Triangle Counting Using the Birthday Paradox (Page 589)
Madhav Jha (The Pennsylvania State University)
C. Seshadhri (Sandia National Laboratories)
Ali Pinar (Sandia National Laboratories)

Complete List of all Best Paper Awards

Research Poster Session

Who, Where, When and What: Discover Spatio-Temporal Topics for Twitter Users (Page 605)
Quan Yuan (Nanyang Technological University)
Gao Cong (Nanyang Technological University)
Zongyang Ma (Nanyang Technological University)
Aixin Sun (Nanyang Technological University)
Nadia Magnenat-Thalmann (Nanyang Technological University)

Multi-Label Classification by Mining Label and Instance Correlations from Heterogeneous Information Networks (Page 614)
Xiangnan Kong (University of Illinois at Chicago)
Bokai Cao (Renmin University of China)
Philip S. Yu (University of Illinois at Chicago & King Abdulaziz University)

Accurate Intelligible Models with Pairwise Interactions (Page 623)
Yin Lou (Cornell University)
Rich Caruana (Microsoft Research)
Johannes Gehrke (Cornell University)
Giles Hooker (Cornell University)

Spotting Opinion Spammers Using Behavioral Footprints (Page 632)
Arjun Mukherjee (University of Illinois at Chicago)
Abhinav Kumar (University of Illinois at Chicago)
Bing Liu (University of Illinois at Chicago)
Junhui Wang (University of Illinois at Chicago)
Meichun Hsu (HP Labs)
Malu Castellanos (HP Labs)
Riddhiman Ghosh (HP Labs)

An Efficient ADMM Algorithm for Multidimensional Anisotropic Total Variation Regularization Problems (Page 641)
Sen Yang (Arizona State University)
Jie Wang (Arizona State University)
Wei Fan (Huawei Noah's Ark Lab)
Xiatian Zhang (Huawei Noah's Ark Lab)
Peter Wonka (Arizona State University)
Jieping Ye (Arizona State University)

Speeding Up Large-Scale Learning with a Social Prior (Page 650)
Deepayan Chakrabarti (Facebook Inc.)
Ralf Herbrich (Amazon Inc.)

FISM: Factored Item Similarity Models for Top-N Recommender Systems (Page 659)
Santosh Kabbur (University of Minnesota)
Xia Ning (NEC Laboratories America)
George Karypis (University of Minnesota)

Nonparametric Hierarchal Bayesian Modeling in Non-Contractual Heterogeneous Survival Data (Page 668)
Shouichi Nagano (Nippon Telegraph and Telephone)
Yusuke Ichikawa (Nippon Telegraph and Telephone)
Noriko Takaya (Nippon Telegraph and Telephone)
Tadasu Uchiyama (Nippon Telegraph and Telephone)
Makoto Abe (The University of Tokyo)

Cross-Task Crowdsourcing (Page 677)
Kaixiang Mo (Hong Kong University of Science and Technology)
Erheng Zhong (Hong Kong University of Science and Technology)
Qiang Yang (Hong Kong University of Science and Technology & Huawei Noah's Ark Lab)

Evaluating the Crowd with Confidence (Page 686)
Manas Joglekar (Stanford University)
Hector Garcia-Molina (Stanford University)
Aditya Parameswaran (Stanford University)

Inferring Social Roles and Statuses in Social Networks (Page 695)
Yuchen Zhao (University of Illinois at Chicago)
Guan Wang (University of Illinois at Chicago)
Philip S. Yu (University of Illinois at Chicago & King Abdulaziz University)
Shaobo Liu (LinkedIn Corp.)
Simon Zhang (LinkedIn Corp.)

Adaptive Collective Routing Using Gaussian Process Dynamic Congestion Models (Page 704)
Siyuan Liu (Carnegie Mellon University)
Yisong Yue (Carnegie Mellon University)
Ramayya Krishnan (Carnegie Mellon University)

Maximizing Acceptance Probability for Active Friending in Online Social Networks (Page 713)
De-Nian Yang (Academia Sinica)
Hui-Ju Hung (Academia Sinica)
Wang-Chien Lee (The Pennsylvania State University)
Wei Chen (Microsoft Research Asia)

Mining Evolutionary Multi-Branch Trees from Text Streams (Page 722)
Xiting Wang (Tsinghua University & Microsoft Research Asia)
Shixia Liu (Microsoft Research Asia)
Yangqiu Song (Hong Kong University of Science and Technology)
Baining Guo (Microsoft Research Asia & Tsinghua University)

Active Search on Graphs (Page 731)
Xuezhi Wang (Carnegie Mellon University)
Roman Garnett (Carnegie Mellon University)
Jeff Schneider (Carnegie Mellon University)

Fast Rank-2 Nonnegative Matrix Factorization for Hierarchical Document Clustering (Page 739)
Da Kuang (Georgia Institute of Technology)
Haesun Park (Georgia Institute of Technology)

A "Semi-Lazy" Approach to Probabilistic Path Prediction in Dynamic Environments (Page 748)
Jingbo Zhou (National University of Singapore)
Anthony K. H. Tung (National University of Singapore)
Wei Wu (Institute for Infocomm Research, A*STAR)
Wee Siong Ng (Institute for Infocomm Research, A*STAR)

Optimizing Parallel Belief Propagation in Junction Trees Using Regression (Page 757)
Lu Zheng (Carnegie Mellon University)
Ole Mengshoel (Carnegie Mellon University)

Multi-Source Deep Learning for Information Trustworthiness Estimation (Page 766)
Liang Ge (The State University of New York at Buffalo)
Jing Gao (The State University of New York at Buffalo)
Xiaoyi Li (The State University of New York at Buffalo)
Aidong Zhang (The State University of New York at Buffalo)

Unsupervised Link Prediction Using Aggregative Statistics on Heterogeneous Social Networks (Page 775)
Tsung-Ting Kuo (National Taiwan University)
Rui Yan (Peking University)
Yu-Yang Huang (National Taiwan University)
Perng-Hwa Kung (National Taiwan University)
Shou-De Lin (National Taiwan University)

Link Prediction with Social Vector Clocks (Page 784)
Conrad Lee (University College Dublin)
Bobo Nick (University of Konstanz)
Ulrik Brandes (University of Konstanz)
Pádraig Cunningham (University College Dublin)

Geo-Spotting: Mining Online Location-Based Services for Optimal Retail Store Placement (Page 793)
Dmytro Karamshuk (IMT Institute for Advanced Studies)
Anastasios Noulas (University of Cambridge)
Salvatore Scellato (University of Cambridge)
Vincenzo Nicosia (Queen Mary University of London)
Cecilia Mascolo (University of Cambridge)

Location-Aware Publish/Subscribe (Page 802)
Guoliang Li (Tsinghua University)
Yang Wang (Coordination Center of China)
Ting Wang (Tsinghua University)
Jianhua Feng (Tsinghua University)

Quadratic Optimization to Identify Highly Heritable Quantitative Traits from Complex Phenotypic Features (Page 811)
Jiangwen Sun (University of Connecticut)
Jinbo Bi (University of Connecticut)
Henry R. Kranzler (University of Pennsylvania)

Repetition-Aware Content Placement in Navigational Networks (Page 820)
Dóra Erdös (Boston University)
Vatche Ishakian (Raytheon BBN Technologies)
Azer Bestavros (Boston University)
Evimaria Terzi (Boston University)

Scalable All-Pairs Similarity Search in Metric Spaces (Page 829)
Ye Wang (The Ohio State University)
Ahmed Metwally (Google Inc.)
Srinivasan Parthasarathy (The Ohio State University)

Massively Parallel Expectation Maximization Using Graphics Processing Units (Page 838)
Muzaffer Can Altinigneli (University of Munich)
Claudia Plant (Helmholtz Zentrum München Technische Universität)
Christian Böhm (University of Munich)

Auto-WEKA: Combined Selection and Hyperparameter Optimization of Classification Algorithms (Page 847)
Chris Thornton (University of British Columbia)
Frank Hutter (University of British Columbia)
Holger H. Hoos (University of British Columbia)
Kevin Leyton-Brown (University of British Columbia)

Direct Optimization of Ranking Measures for Learning to Rank Models (Page 856)
Ming Tan (Wright State University)
Tian Xia (Wright State University)
Lily Guo (Wright State University)
Shaojun Wang (Wright State University)

Multi-Space Probabilistic Sequence Modeling (Page 865)
Shuo Chen (Cornell University)
Jiexun Xu (Cornell University)
Thorsten Joachims (Cornell University)

Towards Never-Ending Learning from Time Series Streams (Page 874)
Yuan Hao (University of California, Riverside)
Yanping Chen (University of California, Riverside)
Jesin Zakaria (University of California, Riverside)
Bing Hu (University of California, Riverside)
Thanawin Rakthanmanon (Kasetsart University)
Eamonn Keogh (University of California, Riverside)

Constrained Stochastic Gradient Descent for Large-Scale Least Squares Problem (Page 883)
Yang Mu (University of Massachusetts Boston)
Wei Ding (University of Massachusetts Boston)
Tianyi Zhou (University of Technology Sydney)
Dacheng Tao (University of Technology Sydney)

Making Recommendations from Multiple Domains (Page 892)
Wei Chen (National University of Singapore)
Wynne Hsu (National University of Singapore)
Mong Li Lee (National University of Singapore)

Cascading Outbreak Prediction in Networks: A Data-Driven Approach (Page 901)
Peng Cui (Tsinghua University)
Shifei Jin (Tsinghua University)
Linyun Yu (Tsinghua University)
Fei Wang (IBM T.J. Watson Research Center)
Wenwu Zhu (Tsinghua University)
Shiqiang Yang (Tsinghua University)

Combining Latent Factor Model with Location Features for Event-Based Group Recommendation (Page 910)
Wei Zhang (Tsinghua University)
Jianyong Wang (Tsinghua University)
Wei Feng (Tsinghua University)

Cost-Sensitive Online Active Learning with Application to Malicious URL Detection (Page 919)
Peilin Zhao (Nanyang Technological University)
Steven C. H. Hoi (Nanyang Technological University)

The Bang for the Buck: Fair Competitive Viral Marketing From the Host Perspective (Page 928)
Wei Lu (University of British Columbia)
Francesco Bonchi (Yahoo! Research)
Amit Goyal (University of British Columbia)
Laks V.S. Lakshmanan (University of British Columbia)

Modeling the Dynamics of Composite Social Networks (Page 937)
Erheng Zhong (Hong Kong University of Science and Technology)
Wei Fan (Huawei Noah's Ark Lab)
Yin Zhu (Hong Kong University of Science and Technology)
Qiang Yang (Hong Kong University of Science and Technology & Huawei Noah's Ark Lab)

A Time-Dependent Enhanced Support Vector Machine for Time Series Regression (Page 946)
Goce Ristanoski (The University of Melbourne)
Wei Liu (The University of Melbourne)
James Bailey (The University of Melbourne)

A New Collaborative Filtering Approach for Increasing the Aggregate Diversity of Recommender Systems (Page 955)
Katja Niemann (Fraunhofer Institute for Applied Information Technology)
Martin Wolpers (Fraunhofer Institute for Applied Information Technology)

Scalable Inference in Max-Margin Topic Models (Page 964)
Jun Zhu (Tsinghua University)
Xun Zheng (Tsinghua University)
Li Zhou (Tsinghua University)
Bo Zhang (Tsinghua University)

A Data-Driven Method for In-game Decision Making in MLB: When to Pull a Starting Pitcher (Page 973)
Gartheeban Ganeshapillai (Massachusetts Institute of Technology)
John Guttag (Massachusetts Institute of Technology)

Exploiting User Clicks for Automatic Seed Set Generation for Entity Matching (Page 980)
Xiao Bai (Yahoo! Research)
Flavio P. Junqueira (Microsoft Research)
Srinivasan H. Sengamedu (Komli Labs)

Silence Is Also Evidence: Interpreting Dwell Time for Recommendation from Psychological Perspective (Page 989)
Peifeng Yin (The Pennsylvania State University)
Ping Luo (Hewlett Packard Labs China)
Wang-Chien Lee (The Pennsylvania State University)
Min Wang (Google Research)

Efficient Single-Source Shortest Path and Distance Queries on Large Graphs (Page 998)
Andy Diwen Zhu (Nanyang Technological University)
Xiaokui Xiao (Nanyang Technological University)
Sibo Wang (Nanyang Technological University)
Wenqing Lin (Nanyang Technological University)

On Community Detection in Real-World Networks and the Importance of Degree Assortativity (Page 1007)
Marek Ciglan (Slovak Academy of Sciences)
Michal Laclavík (Slovak Academy of Sciences)
Kjetil Nørvåg (Norwegian University of Science and Technology)

Trial and Error in Influential Social Networks (Page 1016)
Xiaohui Bei (Nanyang Technological University)
Ning Chen (Nanyang Technological University)
Liyu Dou (Nanyang Technological University)
Xiangru Huang (Shanghai Jiao Tong University)
Ruixin Qiang (Shanghai Jiao Tong University)

Collaborative Matrix Factorization with Multiple Similarities for Predicting Drug-Target Interactions (Page 1025)
Xiaodong Zheng (Fudan University)
Hao Ding (Fudan University)
Hiroshi Mamitsuka (Kyoto University)
Shanfeng Zhu (Fudan University)

FeaFiner: Biomarker Identification from Medical Data through Feature Generalization and Selection (Page 1034)
Jiayu Zhou (Arizona State University)
Zhaosong Lu (Simon Fraser University)
Jimeng Sun (IBM T.J. Watson Research Center)
Lei Yuan (Arizona State University)
Fei Wang (IBM T.J. Watson Research Center)
Jieping Ye (Arizona State University)

Learning Geographical Preferences for Point-of-Interest Recommendation (Page 1043)
Bin Liu (Rutgers University)
Yanjie Fu (Rutgers University)
Zijun Yao (Rutgers University)
Hui Xiong (Rutgers University)

Learning Mixed Kronecker Product Graph Models with Simulated Method of Moments (Page 1052)
Sebastian Moreno (Purdue University)
Jennifer Neville (Purdue University)
Sergey Kirshner (Purdue University)

Measuring Spontaneous Devaluations in User Preferences (Page 1061)
Komal Kapoor (University of Minnesota)
Nisheeth Srivastava (University of Minnesota)
Jaideep Srivastava (University of Minnesota)
Paul Schrater (University of Minnesota)

Mining Evidences for Named Entity Disambiguation (Page 1070)
Yang Li (University of California, Santa Barbara)
Chi Wang (University of Illinois at Urbana-Champaign)
Fangqiu Han (University of California, Santa Barbara)
Jiawei Han (University of Illinois at Urbana-Champaign)
Dan Roth (University of Illinois at Urbana-Champaign)
Xifeng Yan (University of California, Santa Barbara)

Privacy-Preserving Data Exploration in Genome-Wide Association Studies (Page 1079)
Aaron Johnson (U.S. Naval Research Laboratory)
Vitaly Shmatikov (The University of Texas at Austin)

Synthetic Review Spamming and Defense (Page 1088)
Huan Sun (University of California, Santa Barbara)
Alex Morales (University of California, Santa Barbara)
Xifeng Yan (University of California, Santa Barbara)

Information Cartography: Creating Zoomable, Large-Scale Maps of Information (Page 1097)
Dafna Shahaf (Stanford University)
Jaewon Yang (Stanford University)
Caroline Suen (Stanford University)
Jeff Jacobs (Stanford University)
Heidi Wang (Stanford University)
Jure Leskovec (Stanford University)

Restreaming Graph Partitioning: Simple Versatile Algorithms for Advanced Balancing (Page 1106)
Joel Nishimura (Cornell University)
Johan Ugander (Cornell University)

Understanding Evolution of Research Themes: A Probabilistic Generative Model for Citations (Page 1115)
Xiaolong Wang (University of Illinois, Urbana-Champaign)
Chengxiang Zhai (University of Illinois, Urbana-Champaign)
Dan Roth (University of Illinois, Urbana-Champaign)

On the Equivalent of Low-Rank Linear Regressions and Linear Discriminant Analysis Based Regressions (Page 1124)
Xiao Cai (University of Texas at Arlington)
Chris Ding (University of Texas at Arlington)
Feiping Nie (University of Texas at Arlington)
Heng Huang (University of Texas at Arlington)

Industry Practice Expo Invited Presentations

Industry Practice Expo Chairs' Welcome
Rajesh Parekh (Groupon)
Paul Bradley (MethodCare, Inc.)

To Buy or Not to Buy - That Is the Question (Page 1133)
Oren Etzioni (University of Washington)

Mining the Digital Universe of Data to Develop Personalized Cancer Therapies (Page 1134)
Eric Schadt (Mount Sinai School of Medicine)

The Business Impact of Deep Learning (Page 1135)
Jeremy Howard (Kaggle)

Adaptive Adversaries: Building Systems to Fight Fraud and Cyber Intruders (Page 1136)
Ari Gesher (Palantir)

Targeting and Influencing at Scale: From Presidential Elections to Social Good (Page 1137)
Rayid Ghani (University of Chicago & Edgeflip)

Hadoop: A View from the Trenches (Page 1138)
Milind Bhandarkar (Pivotal)

Cyber Security - How Visual Analytics Unlock Insight (Page 1139)
Raffael Marty (Pixlcloud)

Using "Big Data" to Solve "Small Data" Problems (Page 1140)
Chris Neumann (Datahero)

Industy Government Track

Industry Government Program Chairs' Welcome
Rayid Ghani (University of Chicago)
Ted E. Senator (SAIC)

Industry Government -- Deployed Presentations

Financing Lead Triggers: Empowering Sales Reps Through Knowledge Discovery and Fusion (Page 1141)
Kareem S. Aggour (GE Global Research)
Bethany Hoogs (GE Global Research)

Query Clustering Based on Bid Landscape for Sponsored Search Auction Optimization (Page 1150)
Ye Chen (Microsoft Corporation)
Weiguo Liu (Microsoft Corporation)
Jeonghee Yi (Microsoft Corporation)
Anton Schwaighofer (Microsoft Corporation)
Tak W. Yan (Microsoft Corporation)

Analysis of Advanced Meter Infrastructure Data of Water Consumption in Apartment Buildings (Page 1159)
Einat Kermany (IBM Research - Haifa)
Hanna Mazzawi (IBM Research - Haifa)
Dorit Baras (IBM Research - Haifa)
Yehuda Naveh (IBM Research - Haifa)
Hagai Michaelis (Arad Technologies)

Online Controlled Experiments at Large Scale (Page 1168)
Ron Kohavi (Microsoft)
Alex Deng (Microsoft)
Brian Frasca (Microsoft)
Toby Walker (Microsoft)
Ya Xu (Microsoft)
Nils Pohlmann (Microsoft)

iHR: An Online Recruiting System for Xiamen Talent Service Center (Page 1177)
Wenxing Hong (Xiamen University)
Lei Li (Florida International University)
Tao Li (Florida International University)
Wenfu Pan (Xiamen Talent Service Center)

Dynamic Memory Allocation Policies for Postings in Real-Time Twitter Search (Page 1186)
Nima Asadi (University of Maryland)
Jimmy Lin (University of Maryland)
Michael Busch (Twitter)

A Unified Search Federation System Based on Online User Feedback (Page 1195)
Luo Jie (Yahoo! Labs)
Sudarshan Lamkhede (Yahoo! Labs)
Rochit Sapra (Yahoo! Search)
Evans Hsu (Yahoo! Taiwan)
Helen Song (Yahoo! Search)
Yi Chang (Yahoo! Labs)

Amplifying the Voice of Youth in Africa via Text Analytics (Page 1204)
Prem Melville (IBM Research)
Vijil Chenthamarakshan (IBM Research)
Richard D. Lawrence (IBM Rsearch)
James Powell (UNICEF Uganda)
Moses Mugisha (UNICEF Uganda)
Sharad Sapra (UNICEF Uganda)
Rajesh Anandan (US Fund for UNICEF)
Solomon Assefa (IBM Research)

Scalable Supervised Dimensionality Reduction Using Clustering (Page 1213)
Troy Raeder (m6d Research)
Claudia Perlich (m6d Research)
Brian Dalessandro (m6d Research)
Ori Stitelman (m6d Research)
Foster Provost (New York University & m6d Research)

Ad Click Prediction: A View from the Trenches (Page 1222)
H. Brendan McMahan (Google, Inc.)
Gary Holt (Google, Inc.)
D. Sculley (Google, Inc.)
Michael Young (Google, Inc.)
Dietmar Ebner (Google, Inc.)
Julian Grady (Google, Inc.)
Lan Nie (Google, Inc.)
Todd Phillips (Google, Inc.)
Eugene Davydov (Google, Inc.)
Daniel Golovin (Google, Inc.)
Sharat Chikkerur (Google, Inc.)
Dan Liu (Google, Inc.)
Martin Wattenberg (Google, Inc.)
Arnar Mar Hrafnkelsson (Google, Inc.)
Tom Boulos (Google, Inc.)
Jeremy Kubica (Google, Inc.)

Modeling and Probabilistic Reasoning of Population Evacuation During Large-Scale Disaster (Page 1231)
Xuan Song (The University of Tokyo)
Quanshi Zhang (The University of Tokyo)
Yoshihide Sekimoto (The University of Tokyo)
Teerayut Horanont (The University of Tokyo)
Satoshi Ueyama (The University of Tokyo)
Ryosuke Shibasaki (The University of Tokyo)

Using Co-Visitation Networks for Detecting Large Scale Online Display Advertising Exchange Fraud (Page 1240)
Ori Stitelman (m6d Research)
Claudia Perlich (m6d Research)
Brian Dalessandro (m6d Research)
Rod Hook (m6d Research)
Troy Raeder (m6d Research)
Foster Provost (NYU/Stern School and m6d Research)

An Integrated Framework for Optimizing Automatic Monitoring Systems in Large IT Infrastructures (Page 1249)
Liang Tang (Florida International University)
Tao Li (Florida International University)
Larisa Shwartz (IBM Watson Research Center)
Florian Pinel (IBM Watson Research Center)
Genady Ya. Grabarnik (St. John's University)

Improving Quality Control by Early Prediction of Manufacturing Outcomes (Page 1258)
Sholom M. Weiss (IBM T.J. Watson Research Center)
Amit Dhurandhar (IBM T.J. Watson Research Center)
Robert J. Baseman (IBM T.J. Watson Research Center)

Industry Government -- Discovery Presentations

A Data Mining Driven Risk Profiling Method for Road Asset Management (Page 1267)
Daniel Emerson (Queensland University of Technology)
Justin Z. Weligamage (Roadway Engineering Consultant)
Richi Nayak (Queensland University of Technology)

Why People Hate Your App - Making Sense of User Feedback in a Mobile App Store (Page 1276)
Bin Fu (Carnegie Mellon University)
Jialiu Lin (Carnegie Mellon University)
Lei Li (University of California, Berkeley)
Christos Faloutsos (Carnegie Mellon University)
Jason Hong (Carnegie Mellon University)
Norman Sadeh (Carnegie Mellon University)

Towards Long-Lead Forecasting of Extreme Flood Events: A Data Mining Framework for Precipitation Cluster Precursors Identification (Page 1285)
Dawei Wang (University of Massachusets, Boston)
Wei Ding (University of Massachusets, Boston)
Kui Yu (Hefei University of Technology)
Xindong Wu (University of Vermont)
Ping Chen (University of Houston-Downtown)
David L. Small (Tufts University)
Shafiqul Islam (Tufts University)

Predictive Model Performance: Offline and Online Evaluations (Page 1294)
Jeonghee Yi (Microsoft Corporation)
Ye Chen (Microsoft Corporation)
Jie Li (Microsoft Corporation)
Swaraj Sett (Microsoft Corporation)
Tak W. Yan (Microsoft Corporation)

Industry Government -- Emerging Presentations

Uncertainty in Online Experiments with Dependent Data: An Evaluation of Bootstrap Methods (Page 1303)
Eytan Bakshy (Facebook)
Dean Eckles (Facebook)

Knowledge Discovery from Massive Healthcare Claims Data (Page 1312)
Varun Chandola (Oak Ridge National Laboratory)
Sreenivas R. Sukumar (Oak Ridge National Laboratory)
Jack Schryver (Oak Ridge National Laboratory)

Palette Power: Enabling Visual Search Through Colors (Page 1321)
Anurag Bhardwaj (eBay Research Labs)
Atish Das Sarma (eBay Research Labs)
Wei Di (eBay Research Labs)
Raffay Hamid (eBay Research Labs)
Robinson Piramuthu (eBay Research Labs)
Neel Sundaresan (eBay Research Labs)

Heat Pump Detection from Coarse Grained Smart Meter Data with Positive and Unlabeled Learning (Page 1330)
Hongliang Fei (IBM T.J. Watson Research Center)
Younghun Kim (IBM T.J. Watson Research Center)
Sambit Sahu (IBM T.J. Watson Research Center)
Milind Naphade (IBM T.J. Watson Research Center)
Sanjay K. Mamidipalli (IBM Global Business Service)
John Hutchinson (IBM Global Business Service)

Empirical Bayes Model to Combine Signals of Adverse Drug Reactions (Page 1339)
Rave Harpaz (Stanford University)
William DuMouchel (Oracle Health Sciences & Observational Medical Outcomes Partnership)
Paea LePendu (Stanford University)
Nigam H. Shah (Stanford University)

Efficiently Rewriting Large Multimedia Application Execution Traces with Few Event Sequences (Page 1348)
Christiane Kamdem Kengne (University of Grenoble & University of Yasunde I)
Leon Constantin Fopa (University of Grenoble & University of Yasunde I)
Alexandre Termier (University of Grenoble)
Noha Ibrahim (University of Grenoble)
Marie-Christine Rousset (University of Grenoble)
Takashi Washio (Osaka University)
Miguel Santana (STMicroelectronics)

Discriminant Malware Distance Learning on Structural Information for Automated Malware Classification (Page 1357)
Deguang Kong (University of Texas at Arlington)
Guanhua Yan (Los Alamos National Laboratory)

Assessing Team Strategy Using Spatiotemporal Data (Page 1366)
Patrick Lucey (Disney Research Pittsburgh)
Dean Oliver (ESPN)
Peter Carr (Disney Research Pittsburgh)
Joe Roth (Disney Research Pittsburgh)
Iain Matthews (Disney Research Pittsburgh)

Exploratory Analysis of Highly Heterogeneous Document Collections (Page 1375)
Arun S. Maiya (Institute for Defense Analyses)
John P. Thompson (Institute for Defense Analyses)
Francisco Loaiza-Lemos (Institute for Defense Analyses)
Robert M. Rolfe (Institute for Defense Analyses)

Experience from Hosting a Corporate Prediction Market: Benefits Beyond the Forecasts (Page 1384)
Thomas A. Montgomery (Ford Motor Company)
Paul M. Stieg (Ford Motor Company)
Michael J. Cavaretta (Ford Motor Company)
Paul E. Moraal (Ford Motor Company)

Detecting Insider Threats in a Real Corporate Database of Computer Usage Activity (Page 1393)
Ted E. Senator (SAIC)
Henry G. Goldberg (SAIC)
Alex Memory (SAIC)
William T. Young (SAIC)
Brad Rees (SAIC)
Robert Pierce (SAIC)
Daniel Huang (SAIC)
Matthew Reardon (SAIC)
David A. Bader (Georgia Institute of Technology)
Edmond Chow (Georgia Institute of Technology)
Irfan Essa (Georgia Institute of Technology)
Joshua Jones (Georgia Institute of Technology)
Vinay Bettadapura (Georgia Institute of Technology)
Duen Horng Chau (Georgia Institute of Technology)
Oded Green (Georgia Institute of Technology)
Oguz Kaya (Georgia Institute of Technology)
Anita Zakrzewska (Georgia Institute of Technology)
Erica Briscoe (Georgia Institute of Technology)
Rudolph L. Mappus IV (Georgia Institute of Technology)
Robert McColl (Georgia Institute of Technology)
Lora Weiss (Georgia Institute of Technology)
Thomas G. Dietterich (Oregon State University)
Alan Fern (Oregon State University)
Weng-Keen Wong (Oregon State University)
Shubhomoy Das (Oregon State University)
Andrew Emmott (Oregon State University)
Jed Irvine (Oregon State University)
Jay-Yoon Lee (Carnegie Mellon University)
Danai Koutra (Carnegie Mellon University)
Christos Faloutsos (Carnegie Mellon University)
Daniel Corkill (University of Massachusetts)
Lisa Friedland (University of Massachusetts)
Amanda Gentzel (University of Massachusetts)
David Jensen (University of Massachusetts)

Mining for Geographically Disperse Communities in Social Networks by Leveraging Distance Modularity (Page 1402)
Paulo Shakarian (U.S. Military Academy)
Patrick Roos (University of Maryland)
Devon Callahan (U.S. Military Academy)
Cory Kirk (U.S. Military Academy)

An Integrated Framework for Suicide Risk Prediction (Page 1410)
Truyen Tran (Deakin University and Curtin University)
Dinh Phung (Deakin University)
Wei Luo (Deakin University)
Richard Harvey (Barwon Health)
Michael Berk (Deakin University)
Svetha Venkatesh (Deakin University)

Gaussian Multiple Instance Learning Approach for Mapping the Slums of the World Using Very High Resolution Imagery (Page 1419)
Ranga Raju Vatsavai (Oak Ridge National Laboratory)

A Privacy Preserving Framework for Managing Vehicle Data in Road Pricing Systems (Page 1427)
Huayu Wu (Institute for Infocomm Research)
Wee Siong Ng (Institute for Infocomm Research)
Kian-Lee Tan (National University of Singapore)
Wei Wu (Institute for Infocomm Research)
Shili Xiang (Institute for Infocomm Research)
Mingqiang Xue (Institute for Infocomm Research)

U-Air: When Urban Air Quality Inference Meets Big Data (Page 1436)
Yu Zheng (Microsoft Research Asia)
Furui Liu (Microsoft Research Asia)
Hsun-Ping Hsieh (Microsoft Research Asia)

Panel

Panel: A Data Scientist's Guide to Making Money from Start-Ups (Page 1445)
Foster Provost (New York University)
Geoffrey I. Webb (Monash University)

Demonstrations

LAICOS: An Open Source Platform for Personalized Social Web Search (Page 1446)
Mohamed Reda Bouadjenek (PRiSM Laboratory, Versailles University)
Hakim Hacid (Sidetrade, France)
Mokrane Bouzeghoub (PRiSM Laboratory, Versailles University)

JobMiner: A Real-Time System for Mining Job-Related Patterns from Social Media (Page 1450)
Yu Cheng (Northwestern University)
Yusheng Xie (Northwestern University)
Zhengzhang Chen (Northwestern University)
Ankit Agrawal (Northwestern University)
Alok Choudhary (Northwestern University)
Songtao Guo (LinkedIn)

Inferring Distant-Time Location in Low-Sampling-Rate Trajectories (Page 1454)
Meng-Fen Chiang (National Chiao Tung University)
Yung-Hsiang Lin (National Chiao Tung University)
Wen-Chih Peng (National Chiao Tung University)
Philip S. Yu (University of Illinois at Chicago)

AMETHYST: A System for Mining and Exploring Topical Hierarchies of Heterogeneous Data (Page 1458)
Marina Danilevsky (University of Illinois at Urbana-Champaign)
Chi Wang (University of Illinois at Urbana-Champaign)
Fangbo Tao (University of Illinois at Urbana-Champaign)
Son Nguyen (University of Illinois at Urbana-Champaign)
Gong Chen (University of Illinois at Urbana-Champaign)
Nihit Desai (University of Illinois at Urbana-Champaign)
Lidan Wang (University of Illinois at Urbana-Champaign)
Jiawei Han (University of Illinois at Urbana-Champaign)

A Tool for Collecting Provenance Data in Social Media (Page 1462)
Pritam Gundecha (Arizona State University)
Suhas Ranganath (Arizona State University)
Zhuo Feng (Arizona State University)
Huan Liu (Arizona State University)

STED: Semi-Supervised Targeted-Interest Event Detection in Twitter (Page 1466)
Ting Hua (Virginia Tech)
Feng Chen (Carnegie Mellon University)
Liang Zhao (Virginia Tech)
Chang-Tien Lu (Virginia Tech)
Naren Ramakrishnan (Virginia Tech)

Forex-Foreteller: Currency Trend Modeling Using News Articles (Page 1470)
Fang Jin (Virginia Tech)
Nathan Self (Virginia Tech)
Parang Saraf (Virginia Tech)
Patrick Butler (Virginia Tech)
Wei Wang (Virginia Tech)
Naren Ramakrishnan (Virginia Tech)

Real-Time Disease Surveillance Using Twitter Data: Demonstration on Flu and Cancer (Page 1474)
Kathy Lee (Northwestern University)
Ankit Agrawal (Northwestern University)
Alok Choudhary (Northwestern University)

KeySee: Supporting Keyword Search on Evolving Events in Social Streams (Page 1478)
Pei Lee (University of British Columbia)
Laks V.S. Lakshmanan (University of British Columbia)
Evangelos Milios (Dalhousie University)

Understanding Twitter Data with TweetXplorer (Page 1482)
Fred Morstatter (Arizona State University)
Shamanth Kumar (Arizona State University)
Huan Liu (Arizona State University)
Ross Maciejewski (Arizona State University)

An Online System with End-User Services: Mining Novelty Concepts from TV Broadcast Subtitles (Page 1486)
Mika Rautiainen (CSE, University of Oulu)
Jouni Sarvanko (CSE, University of Oulu)
Arto Heikkinen (CSE, University of Oulu)
Mika Ylianttila (CIE, University of Oulu)
Vassilis Kostakos (CSE, University of Oulu)

When TEDDY Meets GrizzLY: Temporal Dependency Discovery for Triggering Road Deicing Operations (Page 1490)
Céline Robardet (Université de Lyon)
Vasile-Marian Scuturici (Université de Lyon)
Marc Plantevit (Université de Lyon)
Antoine Fraboulet (HiKoB)

EventCube: Multi-Dimensional Search and Mining of Structured and Text Data (Page 1494)
Fangbo Tao (University of Illinois at Urbana-Champaign)
Kin Hou Lei (University of Illinois at Urbana-Champaign)
Jiawei Han (University of Illinois at Urbana-Champaign)
ChengXiang Zhai (University of Illinois at Urbana-Champaign)
Xiao Cheng (University of Illinois at Urbana-Champaign)
Marina Danilevsky (University of Illinois at Urbana-Champaign)
Nihit Desai (University of Illinois at Urbana-Champaign)
Bolin Ding (University of Illinois at Urbana-Champaign)
Jing Ge Ge (University of Illinois at Urbana-Champaign)
Heng Ji (City University of New York)
Rucha Kanade (University of Illinois at Urbana-Champaign)
Anne Kao (Boeing Research & Technology)
Qi Li (City University of New York)
Yanen Li (University of Illinois at Urbana-Champaign)
Cindy Xide Lin (University of Illinois at Urbana-Champaign)
Jialu Liu (University of Illinois at Urbana-Champaign)
Nikunj Oza (NASA)
Ashok Srivastava (NASA)
Rod Tjoelker (Boeing Research & Technology)
Chi Wang (University of Illinois at Urbana-Champaign)
Duo Zhang (University of Illinois at Urbana-Champaign)
Bo Zhao (University of Illinois at Urbana-Champaign)

SEA: A System for Event Analysis on Chinese Tweets (Page 1498)
Yaqiong Wang (Beihang University)
Hongfu Liu (Beihang University)
Hao Lin (Beihang University)
Junjie Wu (Beihang University)
Zhiang Wu (Nanjing University of Finance and Economics)
Jie Cao (Nanjing University of Finance and Economics)

SAE: Social Analytic Engine for Large Networks (Page 1502)
Yang Yang (Tsinghua University)
Jianfei Wang (Tsinghua University)
Yutao Zhang (Tsinghua University)
Wei Chen (Tsinghua University)
Jing Zhang (Tsinghua University)
Honglei Zhuang (Tsinghua University)
Zhilin Yang (Tsinghua University)
Bo Ma (Tsinghua University)
Zhanpeng Fang (Tsinghua University)
Sen Wu (Tsinghua University)
Xiaoxiao Li (Tsinghua University)
Debing Liu (Tsinghua University)
Jie Tang (Tsinghua University)

FIU-Miner: A Fast, Integrated, and User-Friendly System for Data Mining in Distributed Environment (Page 1506)
Chunqiu Zeng (Florida International University)
Yexi Jiang (Florida International University)
Li Zheng (Florida International University)
Jingxuan Li (Florida International University)
Lei Li (Florida International University)
Hongtai Li (Florida International University)
Chao Shen (Florida International University)
Wubai Zhou (Florida International University)
Tao Li (Florida International University)
Bing Duan (ChangHong COC Display Devices Co., Ltd.)
Ming Lei (ChangHong COC Display Devices Co., Ltd.)
Pengnian Wang (ChangHong COC Display Devices Co., Ltd.)

LAFT-Explorer: Inferring, Visualizing and Predicting How Your Social Network Expands (Page 1510)
Jun Zhang (Tsinghua University & Ministry of Education)
Chaokun Wang (Tsinghua University)
Yuanchi Ning (Tsinghua University & Ministry of Education)
Yichi Liu (Tsinghua University & Ministry of Education)
Jianmin Wang (Tsinghua University & Ministry of Education)
Philip S. Yu (University of Illinois at Chicago)

A Transfer Learning Based Framework of Crowd-Selection on Twitter (Page 1514)
Zhou Zhao (The Hong Kong University of Science and Technology)
Da Yan (The Hong Kong University of Science and Technology)
Wilfred Ng (The Hong Kong University of Science and Technology)
Shi Gao (The Hong Kong University of Science and Technology)

Risk-O-Meter: An Intelligent Clinical Risk Calculator (Page 1518)
Kiyana Zolfaghar (University of Washington-Tacoma)
Jayshree Agarwal (University of Washington-Tacoma)
Deepthi Sistla (University of Washington-Tacoma)
Si-Chi Chin (University of Washington-Tacoma)
Senjuti Basu Roy (University of Washington-Tacoma)
Nele Verbiest (Ghent University)

Tutorials

Algorithmic Techniques for Modeling and Mining Large Graphs (AMAzING) (Page 1523)
Alan Frieze (Carnegie Mellon University)
Aristides Gionis (Aalto University)
Charalampos Tsourakakis (Aalto Science Fellow)

Mining Data from Mobile Devices: A Survey of Smart Sensing and Analytics (Page 1524)
Spiros Papadimitriou
Tina Eliassi-Rad (Rutgers University)

Big Data Analytics for Healthcare (Page 1525)
Jimeng Sun (IBM TJ Watson Research Center)
Chandan K. Reddy (Wayne State University)

 

Entity Resolution for Big Data (Page 1527)
Lise Getoor (University of Maryland, College Park)
Ashwin Machanavajjhala (Duke University)

Network Sampling (Page 1528)
Mohammad A. Hasan (Indiana University–Purdue University, Indianapolis)
Jennifer Neville (Purdue University)
Nesreen Ahmed (Purdue University)

The Dataminer’s Guide to Scalable Mixed-Membership and Nonparametric Bayesian Models (Page 1529)
Dr. Amr Ahmed (Google)
Dr. Alex Smola (Carnegie Mellon University)