Skip to main content

Curriculum Vitae

SUMMARY


Self-motivated, proactive and curious-minded Ph.D. student working in the areas of Cloud Computing and Big Data analytics with 3 years of industrial experience in full-stack software design and development.


LinkedIn: anirban-bhattacharjee-76285a19              GitHub: https://github.com/Anirban2404


EDUCATION


  • Vanderbilt University | Nashville, TN, USA                                          Aug 2019 (Expected)
    • PhD. in Computer Science
  • West Bengal University of Technology | Kolkata, India                     June 2009
    • Bachelor of Technology in Computer Science and Engineering

TECHNICAL SKILL SETS


  • Programming: Python, Java, NodeJS, PHP, Shell Scripting, C, C++, HTML/CSS, Ansible

  • Database and Data Warehouse Technologies: Oracle, MySQL, Postgres, Microsoft SQL Server, Cassandra, MongoDB, Amazon DynamoDB, Azure CosmosDB, Redshift, SQL Data Warehouse, Informatica

  • Machine Learning/AI Platforms/Libraries: Scikit-learn, TensorFlow, Keras, Weka, Spark MLlib, PyTorch

  • Cloud and Big Data: OpenStack, Azure, AWS, Docker, Kubernetes, Hadoop, Spark, Storm, Kafka, Pig, Hive, ZooKeeper, Raft

  • Development and Design Tools: Eclipse, Visual Studio, Android Studio, Matlab, Jupyter Notebook, PyCharm, Vim, AngularJS, Spring, Informatica, Git/SVN, Maven, Latex


RESEARCH EXPERIENCE


Vanderbilt University, Nashville, TN

Graduate Research Assistant | May 2014-Present

  • StreamlinedML: End-to-end Machine Learning and Big Data analytics framework
    • Developed NodeJS and MongoDB-based web-based interactive GUI to develop predictive application model including data preparation, analysis, visualization, validation.
    • Designed a Python wrapper to integrate a diverse set of Machine Learning(ML) and statistical analysis algorithms and libraries from various (CPU/GPU intensive) frameworks such as TensorFlow, Scikit-learn, Apache Spark MLlib (Python-based).
    • Code and test system modules; develop and leverage frameworks to be efficient.
    • Proactive optimization and scaling of resources for dynamic workloads to run Deep-Learning Prediction jobs across heterogeneous distributed platforms.
      • Guaranteed the SLOs 99% of the time.
      • Performed 50-95% better than the naive approach.
  • CloudCAMP: Self-Service Cloud Deployment and Management Platform
    • Developed an extensible self-service framework by abstracting cloud and application infrastructure specifications using the Model-Driven Engineering.
    • Developed NodeJS based DSML to transform an abstract business model to DevOps-specific infrastructure-as-a-code for different cloud providers.
  • STRATUM: Server selection framework for deployment and migration
    • Developed Python-based Machine Learning models to predict application performance by collecting system-level matrices using collectd and storing the data in InfluxDB.
    • Build a time series model for workload prediction using Facebook Prophet’s Python SDK.
    • Migrated and auto-scaled latency-sensitive applications to minimize cost while satisfying performance constraints across the edge/IoT and cloud resources.
  • Evaluation of Security Algorithms on Time-Triggered Networked Control Systems
    • Implemented and evaluated Message-based Authentication (HMAC) protocol in the TTTech device’s Linux real-time kernel.

Graduate Teaching Assistant | Aug 2012-Apr 2014

  • Assisted 50 students in Programming Languages course (2 semesters)
  • Assisted 50 students in Cloud Computing course (1 semester)
  • Assisted 20 students in Compiler Design course (1 semester)

INDUSTRIAL EXPERIENCE


Arm, Austin, TX

Cloud Research Summer Intern | May 2018 – Aug 2018

  • Developed Engineering Analytics Platform(EAP) for large datasets on Amazon AWS Stack and Microsoft Azure stack using AWS and Azure SDK for Python.
  • Used Stream Processing Engines(Kinesis, Event Hubs), EMR, DataBricks, DynamoDB, CosmosDB, ElasticSearch, Redshift, SQL DW, and Visualization tools(Kibana, PowerBI, QuickSight) to build end-to-end EAP.
  • Evaluated the read/write performance of different datasets and data formats reside on several distributed file systems ( HDFS, Ceph, and S3) to design DataLake infrastructure.

TATA Consultancy Services, Kolkata, India         

Systems Engineer | Dec 2009 – August 2012

  • Developed workflows using PL/SQL, SQL code, and Informatica DW tool to extract, load, transform, and analyze data efficiently.
  • Developed automated software to run the workflows using UNIX Shell Scripting and PL/SQL.
    • Reduced 30% of processing time by introducing parallel query processing and indexing.
    • Contributed code and test cases for system modules as part of the value addition and time reduction of different workflows.
  • Developed an automated report generation tool to produce end-user report GUI by analyzing the business data.
  • Build software for Multi-Room Hotel using JAVA MVC, UNIX, and Oracle Database Management System and implemented multiple user-level views with security.

ACADEMIC PROJECTS


  • A framework for Hadoop Performance Prediction | Apr 2013
    • Benchmarked different Hadoop applications with various configurations for building the performance model.
    • Predicted the cost and performance (using WEKA) of Hadoop based MapReduce applications in the cloud environment.
  • Android Image Synchronization Application | Dec 2012
    • Downloaded images from webpage and stored in content provider.
    • Synchronized the state of the Content Providers among multiple devices.
  • Protein-protein interaction prediction using domain-domain interaction with Multilayer Perceptron | Apr 2011
    • Developed a JAVA based prediction model to predict interactions between two protein sequences by exploiting all possible combinations of constituent domains.
    • The overall MLP prediction accuracy achieved in the range of 66.70%, using twofold cross-validation.

AWARDS AND ACHIEVEMENTS


  •  Awarded for Best Paper in IEEE International Conference on Cloud Engineering (IC2E), 2019 and International Symposium on Resilient Control Systems (ISRCS), 2014

EXTRACURRICULAR ACTIVITIES


  • Participated and Awarded for quizzes in corporate and college competitions.
  • Volunteered in Social Welfare Activities.