Selected Recent Publications
2012
- Huizhong Duan, Emre Kiciman, ChengXiang Zhai,
Click Patterns: An Empirical Representation of Complex Query Intents ,
Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM'12), to appear. pdf
- Yue Lu, Hongning Wang, ChengXiang Zhai, Dan Roth,
Unsupervised Discovery of Opposing Opinion Networks From Forum Discussions,
Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM'12), to appear. pdf
- Bin Tan, Yuanhua Lv, ChengXiang Zhai,
Mining long-lasting exploratory user interests from search history,
Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM'12), to appear. pdf
- Yuanhua Lv, ChengXiang Zhai,
Query Likelihood with Negative Query Generation,
Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM'12), to appear. pdf
- V.G.Vinod Vydiswaran, ChengXiang Zhai, Dan Roth, and Peter Pirolli,
BiasTrust: Teaching biased users about controversial topic,
Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM'12), to appear.
- Huizhong Duan, Yanen Li, ChengXiang Zhai and Dan Roth,
A Discriminative Model for Query Spelling Correction with Latent Structural SVM,
Proceedings of EMNLP-CoNLL 2012 (EMNLP'12), pages 1511-1521, 2012.
- Yanen Li, Huizhong Duan, ChengXiang Zhai,
A Generalized Hidden Markov Model with Discriminative Training for
Query Spelling Correction , Proceedings of ACM SIGIR 2012 (SIGIR'12), pages 611-620, 2012. pdf
- Parikshit Sondhi, Jimeng Sun, Hanghang Tong, ChengXiang Zhai,
SympGraph: A Mining Framework of Clinical Notes through Symptom Relation Graphs, Proceedings
of KDD 2012 (KDD'12), pages 1167-1175, 2012.
- Parikshit Sondhi, Jimeng Sun, ChengXiang Zhai, Robert Sorrentino and Martin S. Kohn,
Leveraging Medical Thesauri and Physician Feedback for Improving Medical Literature Retrieval for Case Queries,
Journal of American Medical Informatics Association (JAMIA), 19(5): 851-858 (2012).
- Kavita Ganesan, Chengxiang Zhai and Evelyne Viegas,
Micropinion Generation: An Unsupervised Approach to Generating Ultra-Concise Summaries of Opinions,
Proceedings of the World Wide Conference 2012 ( WWW'12), pages 869-878, 2012. (acceptance rate 12%) pdf
- Alex Kotov, ChengXiang Zhai,
Tapping into Knowledge Base for Concept Feedback: Leveraging ConceptNet to Improve Search Results for Difficult Queries,
Proceedings of the 5th ACM International Conference on Web Search and Data Mining (WSDM'12), pages 403-412, 2012. (acceptance rate 21%)
-
Shima Gerani, ChengXiang Zhai, Fabio Crestani,
Score Transformation in Linear Combination for Multi-Criteria Relevance Ranking ,
Proceedings of the 34th European Conference on Information Retrieval (ECIR'12), pages 256-267, 2012. (acceptance rate 21%)
-
Parikshit Sondhi, V.G.Vinod Vydiswaran, ChengXiang Zhai,
Reliability Prediction of Webpages in the Medical Domain,
Proceedings of the 34th European Conference on Information Retrieval (ECIR'12), pages 219-231, 2012.(acceptance rate 21%) pdf
-
Maryam Karimzadehgan, Chengxiang Zhai,
Axiomatic Analysis of Translation Language Model For Information Retrieval ,
Proceedings of the 34th European Conference on Information Retrieval (ECIR'12), pages 268-280, 2012. (acceptance rate 21%)
-
Yuanhua Lv, Chengxiang Zhai,
A Log-logistic Model-based Interpretation of TF Normalization of BM25,
Proceedings of the 34th European Conference on Information Retrieval (ECIR'12), pages 244-255, 2012. (acceptance rate 21%)
-
Kavita Ganesan, ChengXiang Zhai, Opinion-based Entity Ranking, Information Retrieval, 15(2): 116-150 (2012) pdf
2011
-
Duo Zhang, ChengXiang Zhai, Jiawei Han,
MiTexCube: MicroTextCluster Cube for Online Analysis of Text Cells,
Proceedings of NASA Conference on Intelligent Data Understanding 2011, to appear.
- Alexander Kotov, ChengXiang Zhai,
An Exploration of the Potential Effectiveness of Interactive Sense Feedback for Difficult Queries,
Proceedings of the 20th ACM International Conference on Information and Knowledge Management (CIKM'11), pages 163-172, 2011.
-
Yuanhua Lv, ChengXiang Zhai,
Lower Bounding Term Frequency Normalization,
Proceedings of the 20th ACM International Conference on Information and Knowledge Management (CIKM'11), pages 7-16, 2011. ( Best Student Paper Award) pdf
- Huizhong Duan, Rui Li, ChengXiang Zhai,
Automatic Query Reformulation with Syntactic Operators to Alleviate Search Difficulty,
Proceedings of the 20th ACM International Conference on Information and Knowledge Management (CIKM'11), poster paper, pages 2037-2040, 2011.
-
Yuanhua Lv, ChengXiang Zhai,
Adaptive Term Frequency Normalization for BM25,
Proceedings of the 20th ACM International Conference on Information and Knowledge Management (CIKM'11), poster paper, pages 1985-1988, 2011.
-
Maryam Karimzadehgan, ChengXiang Zhai,
Improving Retrieval Accuracy of Difficult Queries through Generalizing Negative Document Language Models,
Proceedings of the 20th ACM International Conference on Information and Knowledge Management (CIKM'11), pages 27-36, 2011.
- Hongning Wang, Yue Lu, ChengXiang Zhai,
Latent Aspect Rating Analysis without Aspect Keyword Supervision,
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'11), 2011, pages 618-626. ( 17.5% acceptance)
- V.G.Vinod Vydiswaran, ChengXiang Zhai, Dan Roth,
Content-driven Trust Propagation Framework ,
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'11), 2011, pages 974-982. ( 17.5% acceptance)
-
Hongning Wang, Chi Wang, ChengXiang Zhai, Jiawei Han,
Learning Online Discussion Structures by Conditional Random Fields,
Proceedings of the 34th Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'11 ), 2011, pages 435-444. ( 20% acceptance)
pdf
- Yanen Li, Bo-June Hsu, ChengXiang Zhai, Kuansan Wang,
Unsupervised Query Segmentation Using Clickthrough for Information Retrieval,
Proceedings of the 34th Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'11 ), 2011, pages 285-294. ( 20% acceptance)
- Yuanhua Lv, ChengXiang Zhai, Wan Chen,
A Boosting Approach to Improving Pseudo-Relevance Feedback,
Proceedings of the 34th Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'11 ), 2011, pages 165-174. ( 20% acceptance) pdf
- Hongning Wang, Duo Zhang, ChengXiang Zhai, Structural Topic Model for Latent Topical Structure Analysis,
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL HTL'11), to appear.
pdf
- Yue Lu, Malu Castellanos, Umeshwar Dayal, ChengXiang Zhai, Automatic Construction of a Context-Aware Sentiment
Lexicon: An Optimization Approach,
Proceedings of the World Wide Conference 2011 ( WWW'11), pages 347-356. pdf
- Zhijun Yin, Liangliang Cao, Jiawei Han, Chengxiang Zhai, and Thomas Huang, Geographical Topic Discovery and Comparison,
Proceedings of the World Wide Conference 2011 ( WWW'11), pages 247-256. pdf
-
Huizhong Duan and Chengxiang Zhai, Exploiting Thread Structure to Improve Smoothing of Language Models for Forum Post Retrieval, Proceedings of the 33rd European Conference on Information Retrieval (ECIR'11), to appear. pdf
- Alex Kotov, ChengXiang Zhai, Richard Sproat, Mining Named Entities with Temporally Correlated Bursts from Multilin
gual Web News Streams, Proceedings of WSDM 2011, to appear.
- Hui Fang, Tao Tao, ChengXiang Zhai, Diagnostic Evaluation of Information Retrieval
Models, ACM Transactions on Information Systems (ACM TOIS), to appear. pdf
- Yue Lu, Qiaozhu Mei, ChengXiang Zhai.
Investigating Task Performance of Probabilistic Topic Models - An Empirical Study of PLSA and LDA,
Information Retrieval, vol. 14, no. 2, April, 2011.
2010
- Yanen Li, Jia Hu, ChengXiang Zhai, Ye Chen.
Improving One-Class Collaborative Filtering by Incorporating Rich User Information,
Proceedings of the 19th ACM International Conference on Information and Knowledge Management (CIKM'10), pages 959-968, 2010. ( 13.4% acceptance) pdf
-
Michael J. Paul, ChengXiang Zhai and Roxana Girju.
Summarizing Contrastive Viewpoints In Opinionated Text,
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP'10), pages 65-75, 2010. ( 25% acceptance) pdf
- Kavita Ganesan, ChengXiang Zhai, Jiawei Han.
Opinosis: A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions, Proceedings of COLING 2010, pages 340-348. pdf
-
Yue Lu, Huizhong Duan, Hongning Wang and ChengXiang Zhai.
Exploiting Structured Ontology to Organize Scattered Online Opinions,
Proceedings of COLING 2010, pages 734-742. pdf
-
Parikshit Sondhi, Manish Gupta, ChengXiang Zhai and Julia Hockenmaier.
Shallow Information Extraction from Medical Forum Data,
Proceedings of COLING 2010, pages 1158-1166. pdf
- Hongning Wang, Yue Lu, ChengXiang Zhai.
Latent Aspect Rating Analysis on Review Text Data: A Rating Regression Approach, Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'10), pages 115-124, 2010. pdf
- Xin He, Yanen Li, Radhika Khetani, Barry Sanders, Yue Lu, Xu Ling, ChengXiang Zhai, Bruce Schatz.
BSQA: integrated text mining using entity relation semantics extracted from biological literature of insects, Nucleic Acids Research . download
- Xin He, Moushumi Sen Sarma, Xu Ling, Brant Chee, ChengXiang Zhai, Bruce Schatz.
Identifying overrepresented concepts in gene lists from
literature: a statistical approach based on Poisson mixture
model,
BMC Bioinformatics 2010, 11:272 (20 May 2010). download
- Duo Zhang, Qiaozhu Mei, ChengXiang Zhai.
Cross-Lingual Latent Topic Extraction,
Proceedings of the 48th Annual Meeting of the Association for
Computational Linguistics ( ACL'10), pages 1128-1137, 2010. pdf
- Maryam Karimzadehgan, ChengXiang Zhai,
Estimation of Statistical Translation Models Based on Mutual Information for Ad Hoc Information Retrieval ,
Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'10 ), pages 323-330, 2010.
( 16.7% acceptance) pdf
- Yuanhua Lv, ChengXiang Zhai, Positional Relevance Model for Pseudo-Relevance Feedback ,
Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'10 ), pages 579-586, 2010.
( 16.7% acceptance) pdf
- Alexander Kotov, ChengXiang Zhai, Towards Natural Question-Guided Search,
Proceedings of the World Wide Conference 2010 ( WWW'10), pages 541-550. pdf
- Hyun Duk Kim, ChengXiang Zhai, Jiawei Han, Aggregation of Multiple Judgments for
Evaluating Ordered Lists,
Proceedings of the 32nd European Conference on Information Retrieval (ECIR'10), pages 166-178, 2010. (22% acceptance) pdf
2009
- Xuanhui Wang, Bin Tan, Azadeh Shakery, ChengXiang Zhai, Beyond Hyperlinks: Organizing Information Footprints in Search Logs to Support Effective Browsing,
Proceedings of the 18th ACM International Conference on Information and Knowledge Management ( CIKM'09), pages 1237-1246, 2009.
( full paper, 14.5% acceptance) pdf
-
Hyun Duk Kim, ChengXiang Zhai, Generating Comparative Summaries of Contradictory Opinions in Text,
Proceedings of the 18th ACM International Conference on Information and Knowledge Management ( CIKM'09), pages 385-394, 2009.
( full paper, 14.5% acceptance) pdf
- Yuanhua Lv, ChengXiang Zhai, Adaptive Relevance Feedback in Information Retrieval,
Proceedings of the 18th ACM International Conference on Information and Knowledge Management ( CIKM'09), pages 255-264, 2009.
( full paper, 14.5% acceptance) pdf
- Yuanhua Lv, ChengXiang Zhai, Positonal Language Models for Information Retrieval,
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'09 ), pages 299-306, 2009.
( 16% acceptance) pdf
- Younhee Ko, ChengXiang Zhai, Sandra Rodriguez-Zas, Inference of Gene Pathways using Mixture Bayesian Networks,
BMC Systems Biology, 3:54, 2009, doi:10.1186/1752-0509-3-54. pdf.
- Duo Zhang, ChengXiang Zhai, Jiawei Han, Topic Cube: Topic Modeling for OLAP on Multidimensional Text Databases,
Proceedings of 2009 SIAM International Conference on Data Mining (SDM'09), pages 1123-1134, 2009. ( 16% acceptance)
pdf
- Yue Lu, ChengXiang Zhai, Neel Sundaresan, Rated Aspect Summarization of Short Comments,
Proceedings of the World Wide Conference 2009 ( WWW'09), pages 131-140.
( 12% acceptance) pdf
- Yue Lu, Hui Fang, ChengXiang Zhai, An Empirical Study of Gene Synonym
Query Expansion in Biomedical Information Retrieval, Information Retrieval, Volume 12, Number 1, Feb. 2009, Pages 51-68.
link
2008
- ChengXiang Zhai, Statistical Language Models for Information Retrieval: A Critical Review, Foundations and Trends in Information Retrieval, Vol. 2, No. 3 (2008), pages 137-215, doi:10.1561/1500000008. pdf
- ChengXiang Zhai, Statistical Language Models for Information Retrieval (Synthesis Lectures Series on Human Language Technologies), Morgan & Claypool Publishers, 2008. PDF, Amazon page
- Bo Jin, Brian Muller, ChengXiang Zhai, Xinghua Lu, Multi-label literature classification based on the Gene Ontology graph,
BMC Bioinformatics, 2008, 9:525, doi:10.1186/1471-2105-9-525.
- Maryam Karimzadehgan, ChengXiang Zhai, Geneva Belford, Multi-Aspect Expertise Matching
for Review Assignment,
Proceedings of the 17th ACM International Conference on Information and Knowledge Management ( CIKM'08), pages 1113-1122.
(17% acceptance)
- Xuanhui Wang, ChengXiang Zhai, Mining term association patterns from search logs for effective query reformulation,
Proceedings of the 17th ACM International Conference on Information and Knowledge Management ( CIKM'08), pages 479-488.
(17% acceptance)
-
Deng Cai, Qiaozhu Mei, Jiawei Han, ChengXiang Zhai,
Modeling Hidden Topics on Document Manifold ,
Proceedings of the 17th ACM International Conference on Information and Knowledge Management ( CIKM'08), pages 911-920.
(17% acceptance)
- Xu Ling, Qiaozhu Mei, ChengXiang Zhai, Bruce R. Schatz, Mining multi-faceted overviews of arbitrary topics in a text collection,
Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'08), pages 497-505, 2008.
( 20% acceptance)
- Qiaozhu Mei, Duo Zhang, ChengXiang Zhai.
Smoothing Language Models with Document and Word Graphs ,
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'08 ), pages 611-618.
( 17% acceptance)
- Xuanhui Wang, Hui Fang, ChengXiang Zhai.
A study of methods for negative relevance feedback ,
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'08 ), pages 219-226.
( 17% acceptance)
- Qiaozhu Mei, ChengXiang Zhai.
Generating Impact-Based Summaries for Scientific Literature ,
Proceedings of the 46th Annual Meeting of the Association for
Computational Linguistics: Human Language Technologies ( ACL-08:HLT), pages 816-824. (25% acceptance)
- Yue Lu, ChengXiang Zhai.
Opinion Integration Through Semi-supervised Topic
Modeling,
Proceedings of the World Wide Conference 2008 ( WWW'08), pages 121-130. ( 12% acceptance) pdf.
- Qiaozhu Mei, Deng Cai, Duo Zhang, ChengXiang Zhai.
Topic Modeling with Network Regularization,
Proceedings of the World Wide Conference 2008 ( WWW'08), pages 101-110. (12% acceptance) pdf.
- Azadeh Shakery, ChengXiang Zhai.
Smoothing Document Language Models with Probabilistic
Term Count Propagation, Information Retrieval,
11(2), 2008, pages 139-164.
- Xuanhui Wang, Tao Tao, Jian-Tao Sun, Azadeh Shakery, and ChengXiang Zhai, DirichletRank:
Solving the Zero-One Gap Problem of PageRank, ACM Transactions on Information Systems,
26(2), 2008, Article No. 10.
2007
-
- Qiaozhu Mei, Dong Xin, Hong Cheng, Jiawei Han, and ChengXiang Zhai, Semantic Annotation of
Frequent Patterns, ACM Transactions on Knowledge Discovery from Data,
1(3), Dec. 2007, Article No. 11.
- Jing Jiang, ChengXiang Zhai, A Two-Stage Approach to Domain Adaptation for Statistical Classifiers ,
Proceedings of the 16th ACM International Conference on Information and Knowledge Management ( CIKM'07), pages 401-410.
( full paper, 17% acceptance)
- Xuanhui Wang, Hui Fang, ChengXiang Zhai, Improve Retrieval Accuracy for Difficult Queries using Negative Feedback ,
Proceedings of the 16th ACM International Conference on Information and Knowledge Management ( CIKM'07), pages 991-994.
( short paper, 26% acceptance)
- Shui-Lung Chuang, Kevin Chen-Chuan Chuang, and ChengXiang Zhai,
Context-Aware Wrapping: Synchronized Data Extraction,
Proceedings of the 33rd Very Large Data Bases Conference (VLDB'07),pages 699-710. (17.5% acceptance)
- Xuehua Shen, Bin Tan, and ChengXiang Zhai, Privacy Protection in Personalized Search,
ACM SIGIR Forum , 41(1), pages 4-17. pdf
- Qiaozhu Mei, Xuehua Shen, and ChengXiang Zhai, Automatic Labeling of Multinomial Topic Models ,
Proceedings of the 2007 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'07 ), pages 490-499. ( 19% acceptance ) pdf
- Xuanhui Wang, ChengXiang Zhai, Xiao Hu, and Richard Sproat, Mining Correlated Bursty Topic Patterns from Coordinated Text Streams ,
Proceedings of the 2007 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'07 ), pages 784-793. (19% acceptance rate) pdf
- Xuanhui Wang, ChengXiang Zhai, Learn from Web Search Logs to
Organize Search Results,
Proceedings of the 30th Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'07 ), pages 87-94. ( 18% acceptance) pdf
- Bin Tan, Atulya Velivelli, Hui Fang, ChengXiang Zhai,
Term Feedback for Information Retrieval with Language Models,
Proceedings of the 30th Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'07 ), pages 263-270. ( 18% acceptance) pdf
- Qiaozhu Mei, Hui Fang, ChengXiang Zhai,
A Study of Poisson Query Generation Model for Information Retrieval,
Proceedings of the 30th Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'07 ), pages 319-326. ( 18% acceptance) pdf
- Tao Tao, ChengXiang Zhai, An Exploration of Proximity Measures in Information Retrieval,
Proceedings of the 30th Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'07 ), pages 295-302. ( 18% acceptance) pdf
- Jing Jiang and ChengXiang Zhai, An Empirical Study of
Tokenization Strategies for Biomedical Information Retrieval, Information Retrieval,
10(4-5), Oct. 2007, pp. 341-363. pdf
- Jing Jiang and ChengXiang Zhai, Instance Weighting for Domain Adaptation in NLP,
Proceedings of ACL 2007, pages 264-271. pdf
- Qiaozhu Mei, Xu Ling, Matthew Wondra, Hang Su, ChengXiang Zhai, Topic Sentiment Mixture: Modeling Facets and Opinions in Weblogs, Proceedings of the World Wide Conference 2007 ( WWW'07), pages 171-180. pdf
- Hui Fang, ChengXiang Zhai, Probabilistic Models for Expert Finding , Proceedings of
the 29th European Conference on Information Retrieval (ECIR'07), pages 418-430. ( 19% acceptance) pdf
- Jing Jiang, ChengXiang Zhai,
A Systematic Exploration of The Feature Space for Relation Extraction
,
Proceedings of Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2007), pages 113-120. ( 24% acceptance) pdf
- Xu Ling, Jing Jiang, Xin He, Qiaozhu Mei, ChengXiang Zhai, Bruce Schatz,
Generating Semi-Structured Gene Summaries from Biomedical Literature,
Information Processing and Management, 43(6), Nov. 2007, pp. 1777-1791.
pdf
2006
- Saurabh Sinha, Xu Ling, Charles W. Whitfield, ChengXiang Zhai, and Gene E. Robinson,
Genome scan for cis-regulatory DNA motifs associated with social behavior in honey bees ,
Proceedings of National Academy of Sciences of the United States of America (PNAS) ,
103(44), Oct. 2006, pages 16352-16357. URL
- Jing Jiang and ChengXiang Zhai,
Extraction of coherent relevant passages
using hidden Markov models, ACM Transactions on Information
Systems, 24(3), July 2006, pages 295-319. URL
- Azadeh Shakery and ChengXiang Zhai,
A probabilistic relevance propagation model for hypertext retrieval,
In Proceedings of the 15th ACM International Conference on Information and Knowledge Management ( CIKM'06), pages 550-558. ( 15% acceptance) pdf
- Rong Jin, Luo Si, and ChengXiang Zhai,
A study of mixture models for collaborative filtering, Information Retrieval,
9(3), Jun. 2006, pages 357-382. URL
- Bin Tan, Xuehua Shen, ChengXiang Zhai,
Mining long-term search history to improve search
accuracy ,
Proceedings of the 2006 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , (KDD'06 ), pages 718-723. (poster paper, 23% acceptance) pdf
- Qiaozhu Mei, ChengXiang Zhai,
A Mixture Model for Contextual Text Mining,
Proceedings of the 2006 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , (KDD'06 ), pages 649-655. (poster paper, 23% acceptance) pdf
- Qiaozhu Mei, Dong Xin, Hong Cheng, Jiawei Han, ChengXiang Zhai,
Generating Semantic Annotations for Frequent Patterns
with Context Analysis ,
Proceedings of the 2006 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , (KDD'06 ), pages 337-346. Best Student Paper Award Runner-Up.
(full paper, 11% acceptance) pdf
- Tao Tao, Su-Youn Yoon, Andrew Fister, Richard Sproat and ChengXiang Zhai,
Unsupervised Named Entity Transliteration Using Temporal and Phonetic Correlation ,
Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing (EMNLP 2006), pages 250-257. ( 31% acceptance) pdf
- Richard Sproat, Tao Tao and ChengXiang Zhai,
Named Entity Transliteration with Comparable Corpora,
Proceedings of COLING-ACL 2006, pages 73-80. ( 23% acceptance) pdf
- Xuanhui Wang, Jian-Tao Sun, Zheng Chen, ChengXiang Zhai,
Latent Semantic Analysis for Multiple-Type Interrelated Data Objects
Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR'06 ), pages 236-243. ( 19% acceptance) pdf
- Hui Fang, ChengXiang Zhai,
Semantic Term Matching in Axiomatic Approaches to Information Retrieval
Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR'06 ), pages 115-122. ( 19% acceptance) pdf
- Tao Tao, ChengXiang Zhai,
Regularized Estimation of Mixture Models for Robust Pseudo-Relevance Feedback
Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR'06 ), pages 162-169. ( 19% acceptance) pdf
- Jing Jiang, ChengXiang Zhai,
Exploiting Domain Structure for Named Entity Recognition.
Proceedings of HLT/NAACL 2006, pages 74-81. ( 25% acceptance) pdf, ppt
- Tao Tao, Xuanhui Wang, Qiaozhu Mei, ChengXiang Zhai,
Language Model Information Retrieval with Document Expansion.
Proceedings of HLT/NAACL 2006, pages 407-414. ( 25% acceptance) pdf
- Qiaozhu Mei, Chao Liu, Hang Su, and ChengXiang Zhai,
A Probabilistic Approach to Spatiotemporal Theme Pattern Mining on Weblogs.
Proceedings of the
World Wide Web Conference 2006 ( WWW'06), pages 533-542. (11% acceptance) pdf
- Xu Ling, Jing Jiang, Xin He, Qiaozhu Mei, ChengXiang Zhai, and Bruce Schatz,
Automatically Generating Gene Summaries from Biomedical Literature . In Proceedings of
Pacific Symposium on Biocomputing 2006 (PSB'06), pages 40-51.
pdf
- ChengXiang Zhai and John Lafferty,
A risk minimization framework for information retrieval ,
Information Processing and Management ( IP &M ), 42(1), Jan. 2006. pages 31-55.
URL
2005
- Xuehua Shen, Bin Tan, and ChengXiang Zhai, Implicit User Modeling for Personalized Search ,
In Proceedings of the 14th ACM International Conference on Information and Knowledge Management ( CIKM'05), pages 824-831.
pdf ( 18% acceptance)
- Qiaozhu Mei, ChengXiang Zhai, Discovering Evolutionary Theme Patterns from Text -- An Exploration of Temporal Text Mining,
Proceedings of the 2005 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , (KDD'05 ), pages 198-207, 2005. pdf
(full paper, 12% acceptance)
- Tao Tao, ChengXiang Zhai, Mining Comparable Bilingual Text Corpora for Cross-Language Information Integration ,
Proceedings of the 2005 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'05 ), pages 691-696, 2005. pdf ( poster paper, 22% acceptance)
- Hui Fang, ChengXiang Zhai, An Exploration of Axiomatic Approach to Information Retrieval ,
Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR'05 ), 480-487, 2005.
pdf ( 19% acceptance)
- Xuehua Shen, ChengXiang Zhai, Active Feedback in Ad Hoc Information Retrieval,
Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR'05), 59-66, 2005.
pdf ( 19% acceptance )
- Xuehua Shen, Bin Tan, ChengXiang Zhai, Context-Sensitive Information Retrieval with Implicit Feedback,
Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR'05), 43-50, 2005.
pdf ( 19% acceptance )
2004
-
Tao Tao, ChengXiang Zhai, Xinghua Lu, and Hui Fang, A study of statistical methods for function prediction of protein motifs , Applied Bioinformatics, Volume 3, No. 2-3, pages 115-124. (BLM 03 paper: ps, pdf)
-
Xinghua Lu, Chengxiang Zhai , Vanathi Gopalakrishnan, and Bruce G Buchanan,
Automatic annotation of protein motif function with Gene Ontology terms, BMC Bioinformatics 2004, 5:122. (url) (Impact factor=5.42, as of 2006)
- Hui Fang, Tao Tao, ChengXiang Zhai, A formal study of information retrieval heuristics,
Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR'04), pages 49-56, 2004. Best Paper Award. pdf ( 22% acceptance )
- ChengXiang Zhai, Atulya Velivelli, Bei Yu, A cross-collection mixture model for comparative text mining, Proceedings of ACM KDD 2004 ( KDD'04 ), pages 743-748, 2004. pdf, ppt ( poster paper, 25% acceptance )
- Tao Tao, ChengXiang Zhai, A Mixture Clustering Model for Pseudo Feedback in Information Retrieval ,
Proceedings of the 2004 Meeting of the International Federation of Classification Societies ( IFCS'04), pages 541-552. Invited Paper. pdf
- ChengXiang Zhai, John Lafferty, A study of smoothing methods for language models applied to information retrieval , ACM Transactions on Information Systems ( ACM TOIS ), Vol. 22, No. 2, April 2004, pages 179-214. ( ps)
2003
-
Hwanjo Yu, ChengXiang Zhai, and Jiawei Han,
Text Classification from Positive and Unlabeled Documents , Proceedings of ACM CIKM 2003 (CIKM'03), pages 232-239, 2003. pdf ( 15% acceptance )
- Jin Rong, Luo Si, ChengXiang Zhai, and Jamie Callan,
Collaborative Filtering with Decoupled Models for Preferences and Ratings ,
Proceedings of ACM CIKM 2003 (CIKM'03 ), pages 301-316, 2003. ps, pdf ( 15% acceptance)
- ChengXiang Zhai, William W. Cohen, and John Lafferty, Beyond Independent Relevance: Methods and Evaluation Metrics for Subtopic Retrieval ,
Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR'03 ), pages 10-17, 2003.
ps, pdf ( 17% acceptance )
- Rong Jin, Luo Si, and ChengXiang Zhai, Preference-based Graphic Models for Collaborative Filtering, In Proceedings of UAI 2003 (UAI'03 ), pages 329-336, 2003. ps, pdf ( 25% acceptance )
- John Lafferty and Chengxiang Zhai, Probabilistic relevance models based on document and query generation , In Language Modeling and Information Retrieval, Kluwer International Series on Information Retrieval, Vol. 13, 2003. ps,
pdf
2002
- ChengXiang Zhai, Risk Minimization and Language Modeling in Information Retrieval, Ph.D. thesis, Carnegie Mellon University, 2002. (summary).
- ChengXiang Zhai and John Lafferty, Two-Stage Language Models for Information Retrieval ,
Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR'02), pages 49-56, 2002.
ps, pdf ( 20% acceptance )
- Rong Jin, Alex G. Hauptmann, and ChengXiang Zhai, Title Language
Model for Information Retrieval,
Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR'02 ), pages 42-48, 2002.
ps,
pdf ( 20% acceptance )
2001
- Chengxiang Zhai and John Lafferty, Model-based feedback in the language modeling approach to information retrieval , Proceedings of the Tenth ACM International Conference on Information and Knowledge Management (CIKM'01), pages 403-410, 2001. ps,
pdf ( 25% acceptance)
- Chengxiang Zhai and John Lafferty, A study of smoothing methods for
language models applied to ad hoc information retrieval,
Proceedings of the 24th Annual International ACM SIGIR
Conference on Research and Development in Information Retrieval (SIGIR'01 ), pages 334-342, 2001. ps, pdf
( 23% acceptance )
- John Lafferty and Chengxiang Zhai, Document language models, query models, and risk minimization for information
retrieval ,
Proceedings of the 24th Annual International ACM SIGIR
Conference on Research and Development in Information Retrieval (SIGIR'01 ), pages 111-119, 2001. ps,
pdf ( 23% acceptance )