Gensheng Zhang
Research Interests
- Databases
- Data Mining, Graph Mining
- Computational Journalism
- Crowdsourcing
Skills
About me
I am a CSc Ph.D. student at UTA. I started this program in Sept. 2012. I received an M.Sc. degree in CSc from SDSU in 2012. Prior to that, I had worked as a software engineer at Revenco for a few years, right after I obtained a B.E. degree in Information Security from Wuhan University in 2006.
Currently, I am working in the Innovative Database and Information Systems Research Lab, under supervision of Dr. Chengkai Li.
Education
The Univeristy of Texas at Arlington
South Dakota State University
Wuhan University
Research experiences
The Univeristy of Texas at Arlington
The project strives to complete knowledge graph by soliciting knowledge from the crowd. Currently we focus on missing knowledge detection by leveraging both crowd intelligence and artificial intelligence. Techniques of CrowdSourcing, Collaborative Filtering, and Active Learning are applied to achieve our goal.
The project studies the problem of discovery of long consecutive subsequence consisting of only large (small) values in sequence data, e.g., consecutive games of outstanding performance in sports, consecutive hours of heavy network traffic, and so on. The outcome of this project provides insightful data patterns for data analysis in many real-world applications and is an enabling technique for computational journalism.
South Dakota State University
The project helps to detect breast cancer in early stage, which is the most important stage that can reduce the mortality significantly. We classify breast masses detected in mammograms to tell malignance of the masses. Various image process techniques are applied, for example, Segmentation, Smoothing, Enhancement, and Contour Analysis, etc.
Work experiences
Google Inc.
- Work with Spandex team - SQL support for Spanner
- Developed "Query Reducer": a tool that reduces a complex and lengthy query to its minimal form while retains the issues exhibited in the original query, e.g. reproduces a bug exposed by the query.
NEC Laboratories America, Inc.
- Big data storage improvement - investigating how to store/retrieve enterprise security information efficiently to enable various security applications
- Insider intrusion detection and defense -- defending the enterprise system in-depth
- Incident diagnosis and recovery -- providing root cause analysis of security / performance incidents
The Univeristy of Texas at Arlington
Teaching Assistants of Data Mining, Databases, Intermediate Programming, and other classes.
South Dakota State University
Teaching Assistant of Software Enignieering, Algorithms, and other classes
Revenco Group
Software Engineer Maywide Tech., Guangzhou China 2008 - 2010
- Team Lead of 6 Members
- Elicit and Analyze customer requirements
- Design database conceptual/physical model
- Use C++/Python to implement service billing functionalities
Software Engineer Sunrise Corp., Guangzhou China 2006 - 2008
- Employer of the year (2007, 2008)
- Use C++/Java to implement functionalities for 3 business supporting systems, which serve more than 20 million users.
Publications
Data In, Fact Out: Automated Monitoring of Facts by Factwatcher
N. Hassan, A. Sultana, Y. Wu, G. Zhang#, C. Li, J. Yang, C. Yu.
VLDB'14 The Excellent Demonstration Award
#Contribution: Demonstrates one of the three fact types: Prominent Streak Facts.
Finding, Monitoring, and Checking Claims Computationally Based on Structured Data
The iCheck/uClaim Team (Duke University, University of Texas at Arlington, Google Research)
Computation+Journalism Symposium 2014
Crowdsourcing Pareto-Optimal Object Finding by Pairwise Comparisons
A. Asudeh, G. Zhang, N. Hassan, C. Li, G. Zaruba
arXiv'14
Technical Report
Quality-assured energy balancing for multi-hop wireless multimedia networks via 2-d channel coding rate allocation
L. Xing, W. Wang, G. Zhang, F. Gao, X. Liao, T. Jiang.
RACS'11