I am generally interested in text information management, which includes web search and text mining. My main focus is contextual search, using explicit/implicit user feedback to learn a better model of the user's information need, so that search results can be personalized to achieve higher retrieval accuracy.
| Sept. 2003 - Present |
University of Illinois at Urbana-Champaign Ph.D. candidate in Computer Science M.S. in Computer Science Adviser: ChengXiang Zhai GPA: 4.0 |
| Sept. 1999 - June 2003 |
Nanjing University, China B.S. in Computer Science GPA: 3.9 |
| June 2005 - Present |
Research Assistant for Prof. ChengXiang Zhai Projects:
|
| Summer 2008 |
Intern at Google, Inc. Mentor: Jon Trowbridge I developed system tools to process user data for a Google hosting service. |
| Summer 2007 |
Intern at Yahoo, Inc. Mentor: Fuchun Peng I developed algorithms to segment web search queries to semantic units using language modeling and Wikipedia. A WWW'08 paper is produced based on this work. |
| Summer 2006 |
Intern at Nextumi, Inc. (now ShareThis, Inc.) Supervisor: David Goldberg I worked on text mining and information retrieval algorithms for the company's social network software. This internship is followed by a 1-year research assistantship for and sponsored by the company. |
| Sept. 2003 - May 2005 |
Teaching Assistant for CS423 Operating Systems Design
My duty included designing/grading assignments/exams, holding office hours and giving guest lectures. |
Bin Tan, Fuchun Peng, Unsupervised Query Segmentation using Generative Language Models and Wikipedia. In Proceedings of the 17th International World Wide Web Conference (WWW'08). (11% acceptance) [PDF]
Bin Tan, Atulya Velivelli, Hui Fang, ChengXiang Zhai, Term Feedback for Information Retrieval with Language Models. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'07), pages 263-270. (18% acceptance) [PDF]
Bin Tan, Xuehua Shen, ChengXiang Zhai, Mining long-term search history to improve search accuracy. In Proceedings of the 2006 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'06), pages 718-723. (poster paper, 23% acceptance) [PDF]
Xuehua Shen, Bin Tan, ChengXiang Zhai, Context-Sensitive Information Retrieval with Implicit Feedback. In Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'05), pages 43-50. (19% acceptance) [PDF]
Xuehua Shen, Bin Tan, and ChengXiang Zhai, Implicit User Modeling for Personalized Search. In Proceedings of the 14th ACM International Conference on Information and Knowledge Management (CIKM'05), pages 824-831. (18% acceptance) [PDF]
Xuehua Shen, Bin Tan, and ChengXiang Zhai, Privacy Protection in Personalized Search. In ACM SIGIR Forum, vol 41(1), pages 4-17. [PDF]
Xuehua Shen, Bin Tan, ChengXiang Zhai, UCAIR Toolbar: A Personalized Search Toolbar. In Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'05), page 681. (Demo)
Xuehua Shen, Bin Tan, ChengXiang Zhai, UCAIR: Capturing and Exploiting Context for Personalized Search. In Proceedings of the Second Workshop on Information Retrieval in Context (IRiX'05). [PDF]
Bin Tan, Hui Fang, Atulya Velivelli, Chengxiang Zhai, Interactive Construction of Query Language Models - UIUC TREC 2005 HARD Track Experiments. In Proceedings of the 14th Text REtrieval Conference (TREC'05). [PDF]