Jian Xu

Research = Code + Think + Read
Curriculum Vitae Contact me

Who am I?

A curious data miner

who cannot fall asleep upon new findings.

A creative programmer

who loves making tools and optimizing procedures.

A critical thinker

who asks a lot of "why" and "how".

Research Interest

Data mining and network science

Applications

Using network models and data science to solve interdisciplinary problems in complex systems such as financial market (e.g., information diffusion and trading behavior), social network (e.g., online / mobile phone social interactions), and biology (e.g., species invasions via global shipping) in close collaboration with domain experts.

Theory

Network representation of various types of data; in particular, methods and influences of embedding rich information such as higher-order dependencies into networks.

What's new

Jul 5, 2017
Joined Citadel as data scientist
Data Strategies Group
Apr 18 and Jun 20, 2017
Present higher-order network at IoTDI (CPSWeek) and HONS (NetSci)
IoTDI: ACM/IEEE International Conference on Internet-of-Things Design and Implementation, at Pittsburgh, PA
14:30 - 15:00, White River 103, JW Marriot Indianapolis
May 19, 2017
Successful Ph.D. Defense: Representing Big Data as Networks: New Methods and Insights
Dec 7, 2016
Presentation at Northwestern University
Network Science Collaborative Technology Alliance (NS CTA)
For project "Harvesting Social Signals in Multi-genre Networks to Detect Threatening Emergent Phenomena (I4)"
Aug 2015 and Aug 2016
Network Science Collaborative Technology Alliance (NS CTA)
On project "Discovering Network Processes in Time-evolving Networks (C2)"
Jun 24 - 26, 2016
For higher-order network (HON), and the coauthored paper "Structural diversity and homophily: A study across more than one hundred large-scale networks" with Yuxiao Dong, Reid Johnson, and Nitesh Chawla
May 20, 2016
In the top 5% of all research outputs scored by Altmetric, 97th percentile attention score compared to outputs of the same age, featured in 8 news outlets

Professional Experience

2017 - Present
Data Scientist
Data Strategies Group
Aug 2015 and 2016
Research Intern
US Army Research Lab, Adelphi MD, USA
May 2016
Research Intern
Purdue University, West Lafayette, USA
May 2014 - Aug 2014
Research Intern
IBM Research, Dublin, Ireland
Nov 2009 - Jul 2012
Research Assistant
Adaptive Networks and Control Lab
Fudan University, Shanghai, China

Publications

Representing higher-order dependencies in networks

Science Advances

J. Xu, T.L. Wickramarathne, N.V. Chawla

In the top 5% of all research outputs scored by Altmetric, 97th percentile attention score compared to outputs of the same age, featured in 8 news outlets

Improving management of aquatic invasions by integrating shipping network, ecological, and environmental data: data mining for social good

KDD 2014

J. Xu, T.L. Wickramarathne, N.V. Chawla, E.K. Grey, K. Steinhaeuser, R.P. Keller, J.M. Drake, D.M. Lodge

Catching fire: an anatomy of information diffusion using Retweets

Northern Finance Association Conference 2014

Research in Behavioral Finance Conference 2014

10th Annual Central Bank Workshop on the Microstructure of Financial Markets 2014

N.V. Chawla, Z. Da, J. Xu, M. Ye

Human interactive patterns in temporal networks

IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans

Y.Q. Zhang, X. Li, J. Xu, A.V.Vasilakos

Mining Features Associated with Effective Tweets

International Conference on Advances in Social Networks Analysis and Mining (ASONAM) 2017

J. Xu, N.V. Chawla

Structural Diversity and Homophily: A Study Across More Than One Hundred Big Networks

ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) 2017

J. Xu, N.V. Chawla

Skills

Programming

Python, SQL and Spark for data mining; C# for graphical interface; C for high performance computing

Big Data

Shell scripts for managing parallelized jobs on servers and distributed environments Vertica and Snowflake for billion-scale database queries; Airflow and Linux shell scripts for managing parallelized jobs on Amazon S3 and distributed systems.

Tools

sklearn and Weka for machine learning; Tableau and Gephi for visualization; ArcGIS for geographical information system; NetworkX for network analysis.

Education

2012 - 2017 (expected)
Ph.D. in Computer Science and Engineering
University of Notre Dame, USA
GPA: 4.00/4.00
2008 - 2012
B.Sc. in Electronics Engineering
Fudan University, Shanghai, China
(Top 5 in China)

Honors & Awards

Jun 2017
Advanced Teaching Scholar Certificate
Notre Dame
Nov 2016
Striving for Excellence in Teaching Certificate
Notre Dame
Nov 2015
Outstanding Research Poster Award – Faculty Vote
Notre Dame
May 2013
1st Prize, Schurz Innovation Award on Data Mining
Schurz Communications Inc.
Jun 2012
Outstanding Bachelor Thesis Award
Fudan University
Best in the Department of Electronics Engineering
May 2011
National University Student of the Year in 2010
Chinese Ministry of Education
Only 10 students in China
2011
2nd Prize, National Undergraduate Electronic Design Contest, Shanghai site
As team leader
2010
Excellent Student in Fudan University
Fudan University
Top 5% in the university
2010
Top Ten Students in the School of Information Technology
Out of 1200 students in the school
2010
People's Scholarship
Top 10% in the department. Awarded twice.
2006
1st Prize, National Olympiad in Informatics in Provinces

Hobbies

Flute

Amateur Lv. 9 "Excellent"

Trivia

DIY projects

Instead of using smart speakers like Alexa and subscribe to podcasts, I built my own "radio alarm", which everyday at 8AM, it starts by telling my girlfriend how much I love her (with a different phrase everyday), plays our favorite music, reads the weather forecast, and reads the top financial and tech news from my RSS feeds with a TTS engine.

I also enjoy automating my home, from auto feeders for cats and fish, to the power switches behind my TV.

Higher-order Network
Interdisciplinary paper / Science Advances
HON visualization
Software package / IEEE PacificVis
Aquatic invasion
data mining
Data mining paper / KDD 2014
Tweet diffusion
Finance paper / Under review
Temporal motifs
Networks paper / IEEE SMCA
Effective Tweeting
Paper / Software
Leadership &
media coverage
Media coverage
Lecture Room 5023
Video series

Contact me



Curriculum Vitae

Let's keep in touch!