Resume

YUDONG CAO

2006 Broadway Apt 309, Nashville, TN 37203

 yudong.cao@vanderbilt.edu | (859)583-8733 | www.linkedin.com/in/yudongcao

 

EDUCATION

VANDERBILT UNIVERSITY                                                                                                   Nashville, TN

Master’s Degree in Quantitative Methods | GPA: 3.93                                           Expected May 2017

Bachelor’s Degree in Mathematics, magna cum laude | GPA: 3.83                                             May 2015

  • Honors & Awards: The Peabody Honors Scholarship ($33,500), Phi Beta Kappa, Tutor of the Month
  • Relevant Coursework: Statistical Computing, Regression, Experimental Design, Bayesian Analysis, Exploratory Data Analysis, Factor Analysis, Program Design (Java), Data Structures (Python)
  • Certifications: Machine Learning by Stanford University on Coursera (Jan 2017), Practical Machine Learning by Johns Hopkins University on Coursera (In Progress), SAS Base Programmer (Feb 2017), SAS Statistical Business Analyst (In Progress), SAS Advanced Programmer (In Progress)

 

SKILLS

  • Programming & statistical operation: R, SAS, Python, Java, SQL, Matlab, Mathematica, SPSS
  • Document preparation & text formatting: LaTeX, Git, GitHub, R Markdown, Box, Redcap, Prezi

 

RELEVANT EXPERIENCE

VANDERBILT UNIVERSITY                                                                                                   Nashville, TN

Data Science Research Assistant                                                                          December 2016 – Present

  • Research and implement Natural Language Processing algorithms in Python, performing text and sentimental analyses on text corpus to characterize cross-cultural conceptualization of certain concepts
  • Implement Machine Learning algorithms to train Word2Vec and LDA models to vectorize text data, calculate distances between words, and rank order words in terms of their similarities to the keyword

 

TOTAL-APPS, INC.                                                                                             Orange County, CA

Operations Data Analyst Summer Intern                                                              May 2016 – August 2016

  • Identified and voided potential fraudulent transactions of on average $2,500 daily for a multi-billion-dollar client with logistic regression model, controlling chargebacks and sustaining account reputation
  • Improved the reliability of the credit card transactions risk scoring model by conducting chargeback reserve tracking analysis and developing more efficient queries with auto-fill spreadsheet templates
  • Forecast daily credit card declines and frauds with time series models for 20+ high-risk merchants
  • Provided data management and visualization support, transforming data into information and insights

 

VANDERBILT MEDICAL CENTER                                                                             Nashville, TN

Research Assistant, Data Extractor                                                                   December 2015 – May 2016

  • Developed, organized and managed databases for 5 major clinical trial meta-analysis research projects with quantitative and qualitative data extracted from 1,000+ published research articles worldwide
  • Coded and implemented SQL queries for biostatistical syntheses, transforming former Excel pivot table algorithms into the Access SQL platform and increasing the database management efficiency by >50%

 

PROJECTS

  • Football Player Dollar Values Prediction and Simulation with Data Visualization in R
  • Exploring the Utility of Factor Analysis in Social Media Analytics using R and Mplus
  • Exploratory Data Analysis on Time Use by Principals Assistant Principals with SAS Graphics
  • Bayesian Inference on the Logit Demand Model using R and WinBUGS
  • Path-Independent American Derivatives Pricing Model in Mathematica

Back Home