Main Page

Table of Contents

Author Index

Co-Located Workshops

ACM SIGKDD Membership

Author Index


Abbassi, Zeinab

Diversity Maximization Under Matroid Constraints (Page 32)


Abe, Makoto

Nonparametric Hierarchal Bayesian Modeling in Non-Contractual Heterogeneous Survival Data (Page 668)


Abrahao, Bruno

Trace Complexity of Network Inference (Page 491)


Agarwal, Deepak

Estimating Sharer Reputation via Social Data Calibration (Page 59)


Agarwal, Jayshree

Risk-O-Meter: An Intelligent Clinical Risk Calculator (Page 1518)


Agarwal, Sameer

A General Bootstrap Performance Diagnostic (Page 419)


Agarwal, Shivani

SVMpAUCtight: A New Support Vector Method for Optimizing Partial AUC Based on a Tight Convex Upper Bound (Page 167)


Aggarwal, Charu

Selective Sampling on Graphs for Classification (Page 131)


Aggour, Kareem S.

Financing Lead Triggers: Empowering Sales Reps Through Knowledge Discovery and Fusion (Page 1141)


Agrawal, Ankit

JobMiner: A Real-Time System for Mining Job-Related Patterns from Social Media (Page 1450)

Real-Time Disease Surveillance Using Twitter Data: Demonstration on Flu and Cancer (Page 1474)


Ahmed, Amr

The Dataminer’s Guide to Scalable Mixed-Membership and Nonparametric Bayesian Models (Page 1529)


Altinigneli, Muzaffer Can

Massively Parallel Expectation Maximization Using Graphics Processing Units (Page 838)


Anandan, Rajesh

Amplifying the Voice of Youth in Africa via Text Analytics (Page 1204)


Anchuri, Pranay

Approximate Graph Mining with Label Costs (Page 518)


Asadi, Nima

Dynamic Memory Allocation Policies for Postings in Real-Time Twitter Search (Page 1186)


Assefa, Solomon

Amplifying the Voice of Youth in Africa via Text Analytics (Page 1204)


Baba, Yukino

Statistical Quality Estimation for General Crowdsourcing Tasks (Page 554)


Bache, Kevin

Text-Based Measures of Document Diversity (Page 23)


Backstrom, Lars

Graph Cluster Randomization: Network Exposure to Multiple Universes (Page 329)


Bader, David A.

Detecting Insider Threats in a Real Corporate Database of Computer Usage Activity (Page 1393)


Bahadori, Mohammad Taha

Fast Structure Learning in Generalized Stochastic Processes with Latent Factors (Page 284)


Bai, Xiao

Exploiting User Clicks for Automatic Seed Set Generation for Entity Matching (Page 980)


Bailey, James

A Time-Dependent Enhanced Support Vector Machine for Time Series Regression (Page 946)


Bakshy, Eytan

Uncertainty in Online Experiments with Dependent Data: An Evaluation of Bootstrap Methods (Page 1303)


Baras, Dorit

Analysis of Advanced Meter Infrastructure Data of Water Consumption in Apartment Buildings (Page 1159)


Barkol, Omer

Approximate Graph Mining with Label Costs (Page 518)


Baseman, Robert J.

Improving Quality Control by Early Prediction of Manufacturing Outcomes (Page 1258)


Basu Roy, Senjuti

Risk-O-Meter: An Intelligent Clinical Risk Calculator (Page 1518)


Batista, Gustavo E.A.P.A

DTW-D: Time Series Semi-Supervised Learning from a Single Example (Page 383)


Bei, Xiaohui

Trial and Error in Influential Social Networks (Page 1016)


Berk, Michael

An Integrated Framework for Suicide Risk Prediction (Page 1410)


Bestavros, Azer

Repetition-Aware Content Placement in Navigational Networks (Page 820)


Bettadapura, Vinay

Detecting Insider Threats in a Real Corporate Database of Computer Usage Activity (Page 1393)


Bhandarkar, Milind

Hadoop: A View from the Trenches (Page 1138)


Bhardwaj, Anurag

Palette Power: Enabling Visual Search Through Colors (Page 1321)


Bi, Jinbo

Quadratic Optimization to Identify Highly Heritable Quantitative Traits from Complex Phenotypic Features (Page 811)


Bian, Jiang

Psychological Advertising: Exploring User Psychology for Click Prediction in Sponsored Search (Page 563)


Bifet, Albert

STRIP: Stream Learning of Influence Probabilities (Page 275)


Böhm, Christian

Massively Parallel Expectation Maximization Using Graphics Processing Units (Page 838)


Bonchi, Francesco

Denser Than the Densest Subgraph: Extracting Optimal Quasi-Cliques with Quality Guarantees (Page 104)

STRIP: Stream Learning of Influence Probabilities (Page 275)

The Bang for the Buck: Fair Competitive Viral Marketing From the Host Perspective (Page 928)

The Role of Information Diffusion in the Evolution of Social Networks (Page 356)


Bouadjenek, Mohamed Reda

LAICOS: An Open Source Platform for Personalized Social Web Search (Page 1446)


Boulos, Tom

Ad Click Prediction: A View from the Trenches (Page 1222)


Bouzeghoub, Mokrane

LAICOS: An Open Source Platform for Personalized Social Web Search (Page 1446)


Boyles, Levi

Stochastic Collapsed Variational Bayesian Inference for Latent Dirichlet Allocation (Page 446)


Bradley, Paul

Industry Practice Expo Chairs' Welcome Message


Brandes, Ulrik

Link Prediction with Social Vector Clocks (Page 784)


Briscoe, Erica

Detecting Insider Threats in a Real Corporate Database of Computer Usage Activity (Page 1393)


Busch, Michael

Dynamic Memory Allocation Policies for Postings in Real-Time Twitter Search (Page 1186)


Butler, Patrick

Forex-Foreteller: Currency Trend Modeling Using News Articles (Page 1470)


Cai, Xiao

On the Equivalent of Low-Rank Linear Regressions and Linear Discriminant Analysis Based Regressions (Page 1124)


Callahan, Devon

Mining for Geographically Disperse Communities in Social Networks by Leveraging Distance Modularity (Page 1402)


Campello, Ricardo J. G. B.

Subsampling for Efficient and Effective Unsupervised Outlier Detection Ensembles (Page 428)


Canny, John

Big Data Analytics with Small Footprint: Squaring the Cloud (Page 95)


Cao, Bokai

Multi-Label Classification by Mining Label and Instance Correlations from Heterogeneous Information Networks (Page 614)


Cao, Caleb Chen

WiseMarket: A New Paradigm for Managing Wisdom of Online Social Users (Page 455)


Cao, Jie

SEA: A System for Event Analysis on Chinese Tweets (Page 1498)


Carmichael, Owen

Network Discovery via Constrained Tensor Analysis of fMRI Data (Page 194)


Carr, Peter

Assessing Team Strategy Using Spatiotemporal Data (Page 1366)


Caruana, Rich

Accurate Intelligible Models with Pairwise Interactions (Page 623)


Castellanos, Malu

Spotting Opinion Spammers Using Behavioral Footprints (Page 632)


Castillo, Carlos

The Role of Information Diffusion in the Evolution of Social Networks (Page 356)


Cavaretta, Michael J.

Experience from Hosting a Corporate Prediction Market: Benefits Beyond the Forecasts (Page 1384)


Chakrabarti, Deepayan

Speeding Up Large-Scale Learning with a Social Prior (Page 650)


Chandola, Varun

Knowledge Discovery from Massive Healthcare Claims Data (Page 1312)


Chang, Chun-Fu

Indexed Block Coordinate Descent for Large-Scale Linear Classification with Limited Memory (Page 248)


Chang, Yi

A Unified Search Federation System Based on Online User Feedback (Page 1195)


Chau, Duen Horng

Detecting Insider Threats in a Real Corporate Database of Computer Usage Activity (Page 1393)


Chen, Bee-Chung

Estimating Sharer Reputation via Social Data Calibration (Page 59)


Chen, Feng

STED: Semi-Supervised Targeted-Interest Event Detection in Twitter (Page 1466)


Chen, Gong

AMETHYST: A System for Mining and Exploring Topical Hierarchies of Heterogeneous Data (Page 1458)


Chen, Huanhuan

Model-Based Kernel for Efficient Time Series Analysis (Page 392)


Chen, Lei

WiseMarket: A New Paradigm for Managing Wisdom of Online Social Users (Page 455)


Chen, Ling

LCARS: A Location-Content-Aware Recommender System (Page 221)

Summarizing Probabilistic Frequent Patterns: A Fast Approach (Page 527)


Chen, Ning

Trial and Error in Influential Social Networks (Page 1016)


Chen, Ping

Towards Long-Lead Forecasting of Extreme Flood Events: A Data Mining Framework for Precipitation Cluster Precursors Identification (Page 1285)


Chen, Shuo

Multi-Space Probabilistic Sequence Modeling (Page 865)


Chen, Wei

Making Recommendations from Multiple Domains (Page 892)

Maximizing Acceptance Probability for Active Friending in Online Social Networks (Page 713)

SAE: Social Analytic Engine for Large Networks (Page 1502)


Chen, Wenlin

Density-Based Logistic Regression (Page 140)


Chen, Yanping

DTW-D: Time Series Semi-Supervised Learning from a Single Example (Page 383)

Towards Never-Ending Learning from Time Series Streams (Page 874)


Chen, Ye

Predictive Model Performance: Offline and Online Evaluations (Page 1294)

Query Clustering Based on Bid Landscape for Sponsored Search Auction Optimization (Page 1150)


Chen, Yixin

Density-Based Logistic Regression (Page 140)


Chen, Zhengzhang

JobMiner: A Real-Time System for Mining Job-Related Patterns from Social Media (Page 1450)


Cheng, James

Redundancy-Aware Maximal Cliques (Page 122)


Cheng, Wei

Flexible and Robust Co-Regularized Multi-Domain Graph Clustering (Page 320)


Cheng, Xiao

EventCube: Multi-Dimensional Search and Mining of Structured and Text Data (Page 1494)


Cheng, Yu

JobMiner: A Real-Time System for Mining Job-Related Patterns from Social Media (Page 1450)


Chenthamarakshan, Vijil

Amplifying the Voice of Youth in Africa via Text Analytics (Page 1204)


Chiang, Meng-Fen

Inferring Distant-Time Location in Low-Sampling-Rate Trajectories (Page 1454)


Chierichetti, Flavio

Trace Complexity of Network Inference (Page 491)


Chikkerur, Sharat

Ad Click Prediction: A View from the Trenches (Page 1222)


Chin, Si-Chi

Risk-O-Meter: An Intelligent Clinical Risk Calculator (Page 1518)


Choudhary, Alok

JobMiner: A Real-Time System for Mining Job-Related Patterns from Social Media (Page 1450)

Real-Time Disease Surveillance Using Twitter Data: Demonstration on Flu and Cancer (Page 1474)


Chow, Edmond

Detecting Insider Threats in a Real Corporate Database of Computer Usage Activity (Page 1393)


Ciglan, Marek

On Community Detection in Real-World Networks and the Importance of Degree Assortativity (Page 1007)


CISL, Team Members

Scale-Out Beyond Map-Reduce (Page 1)


Cong, Gao

Who, Where, When and What: Discover Spatio-Temporal Topics for Twitter Users (Page 605)


Corkill, Daniel

Detecting Insider Threats in a Real Corporate Database of Computer Usage Activity (Page 1393)


Cui, Bin

LCARS: A Location-Content-Aware Recommender System (Page 221)


Cui, Peng

Cascading Outbreak Prediction in Networks: A Data-Driven Approach (Page 901)

Comparing Apples to Oranges: A Scalable Solution with Heterogeneous Hashing (Page 230)


Cunningham, Pádraig

Link Prediction with Social Vector Clocks (Page 784)


Dalessandro, Brian

Scalable Supervised Dimensionality Reduction Using Clustering (Page 1213)

Using Co-Visitation Networks for Detecting Large Scale Online Display Advertising Exchange Fraud (Page 1240)


Danilevsky, Marina

A Phrase Mining Framework for Recursive Construction of a Topical Hierarchy (Page 437)

AMETHYST: A System for Mining and Exploring Topical Hierarchies of Heterogeneous Data (Page 1458)

EventCube: Multi-Dimensional Search and Mining of Structured and Text Data (Page 1494)


Das, Abhimanyu

Debiasing Social Wisdom (Page 500)


Das, Mahashweta

Learning to Question: Leveraging User Preferences for Shopping Advice (Page 203)


Das, Shubhomoy

Detecting Insider Threats in a Real Corporate Database of Computer Usage Activity (Page 1393)


Das Sarma, Atish

Palette Power: Enabling Visual Search Through Colors (Page 1321)


Davidson, Ian

Guided Learning for Role Discovery (GLRD): Framework, Algorithms, and Applications (Page 113)

Network Discovery via Constrained Tensor Analysis of fMRI Data (Page 194)


Davies, Alex

SiGMa: Simple Greedy Matching for Aligning Large Knowledge Bases (Page 572)


Davydov, Eugene

Ad Click Prediction: A View from the Trenches (Page 1222)


De Francisci Morales, Gianmarco

Learning to Question: Leveraging User Preferences for Shopping Advice (Page 203)


Deng, Alex

Online Controlled Experiments at Large Scale (Page 1168)


Deng, Xinwei

Robust Sparse Estimation of Multiresponse Regression and Inverse Covariance Matrix via the L2 Distance (Page 293)


Desai, Nihit

A Phrase Mining Framework for Recursive Construction of a Topical Hierarchy (Page 437)

AMETHYST: A System for Mining and Exploring Topical Hierarchies of Heterogeneous Data (Page 1458)

EventCube: Multi-Dimensional Search and Mining of Structured and Text Data (Page 1494)


Dhillon, Inderjit S.

Research Track Program Chairs' Welcome Message


Dhurandhar, Amit

Improving Quality Control by Early Prediction of Manufacturing Outcomes (Page 1258)


Di, Wei

Palette Power: Enabling Visual Search Through Colors (Page 1321)


Di-Castro, Dotan

Model Selection in Markovian Processes (Page 374)


Dietterich, Thomas G.

Detecting Insider Threats in a Real Corporate Database of Computer Usage Activity (Page 1393)


Ding, Bolin

EventCube: Multi-Dimensional Search and Mining of Structured and Text Data (Page 1494)


Ding, Chris

On the Equivalent of Low-Rank Linear Regressions and Linear Discriminant Analysis Based Regressions (Page 1124)


Ding, Hao

Collaborative Matrix Factorization with Multiple Similarities for Predicting Drug-Target Interactions (Page 1025)


Ding, Wei

Constrained Stochastic Gradient Descent for Large-Scale Least Squares Problem (Page 883)

Towards Long-Lead Forecasting of Extreme Flood Events: A Data Mining Framework for Precipitation Cluster Precursors Identification (Page 1285)


Dou, Liyu

Trial and Error in Influential Social Networks (Page 1016)


Duan, Bing

FIU-Miner: A Fast, Integrated, and User-Friendly System for Data Mining in Distributed Environment (Page 1506)


DuBois, Christopher

Stochastic Collapsed Variational Bayesian Inference for Latent Dirichlet Allocation (Page 446)


DuMouchel, William

Empirical Bayes Model to Combine Signals of Adverse Drug Reactions (Page 1339)


Ebner, Dietmar

Ad Click Prediction: A View from the Trenches (Page 1222)


Eckles, Dean

Uncertainty in Online Experiments with Dependent Data: An Evaluation of Bootstrap Methods (Page 1303)


Eftekhar, Milad

Information Cascade at Group Scale (Page 401)


El-Arini, Khalid

Representing Documents Through Their Readers (Page 14)


Eliassi-Rad, Tina

Guided Learning for Role Discovery (GLRD): Framework, Algorithms, and Applications (Page 113)

Mining Data from Mobile Devices: A Survey of Smart Sensing and Analytics (Page 1524)


Emerson, Daniel

A Data Mining Driven Risk Profiling Method for Road Asset Management (Page 1267)


Emmott, Andrew

Detecting Insider Threats in a Real Corporate Database of Computer Usage Activity (Page 1393)


Erdös, Dóra

Repetition-Aware Content Placement in Navigational Networks (Page 820)


Essa, Irfan

Detecting Insider Threats in a Real Corporate Database of Computer Usage Activity (Page 1393)


Etzioni, Oren

To Buy or Not to Buy - That Is the Question (Page 1133)