Curriculum Vitae
SUMMARY
Self-motivated, proactive and curious-minded Ph.D. student working in the areas of Cloud Computing and Big Data analytics with 3 years of industrial experience in full-stack software design and development.
LinkedIn: anirban-bhattacharjee-76285a19 GitHub: https://github.com/Anirban2404
EDUCATION
- Vanderbilt University | Nashville, TN, USA Aug 2019 (Expected)
- PhD. in Computer Science
- West Bengal University of Technology | Kolkata, India June 2009
- Bachelor of Technology in Computer Science and Engineering
TECHNICAL SKILL SETS
-
Programming: Python, Java, NodeJS, PHP, Shell Scripting, C, C++, HTML/CSS, Ansible
-
Database and Data Warehouse Technologies: Oracle, MySQL, Postgres, Microsoft SQL Server, Cassandra, MongoDB, Amazon DynamoDB, Azure CosmosDB, Redshift, SQL Data Warehouse, Informatica
-
Machine Learning/AI Platforms/Libraries: Scikit-learn, TensorFlow, Keras, Weka, Spark MLlib, PyTorch
-
Cloud and Big Data: OpenStack, Azure, AWS, Docker, Kubernetes, Hadoop, Spark, Storm, Kafka, Pig, Hive, ZooKeeper, Raft
-
Development and Design Tools: Eclipse, Visual Studio, Android Studio, Matlab, Jupyter Notebook, PyCharm, Vim, AngularJS, Spring, Informatica, Git/SVN, Maven, Latex
RESEARCH EXPERIENCE
Vanderbilt University, Nashville, TN
Graduate Research Assistant | May 2014-Present
- StreamlinedML: End-to-end Machine Learning and Big Data analytics framework
- Developed NodeJS and MongoDB-based web-based interactive GUI to develop predictive application model including data preparation, analysis, visualization, validation.
- Designed a Python wrapper to integrate a diverse set of Machine Learning(ML) and statistical analysis algorithms and libraries from various (CPU/GPU intensive) frameworks such as TensorFlow, Scikit-learn, Apache Spark MLlib (Python-based).
- Code and test system modules; develop and leverage frameworks to be efficient.
- Proactive optimization and scaling of resources for dynamic workloads to run Deep-Learning Prediction jobs across heterogeneous distributed platforms.
- Guaranteed the SLOs 99% of the time.
- Performed 50-95% better than the naive approach.
- CloudCAMP: Self-Service Cloud Deployment and Management Platform
- Developed an extensible self-service framework by abstracting cloud and application infrastructure specifications using the Model-Driven Engineering.
- Developed NodeJS based DSML to transform an abstract business model to DevOps-specific infrastructure-as-a-code for different cloud providers.
- STRATUM: Server selection framework for deployment and migration
- Developed Python-based Machine Learning models to predict application performance by collecting system-level matrices using collectd and storing the data in InfluxDB.
- Build a time series model for workload prediction using Facebook Prophet’s Python SDK.
- Migrated and auto-scaled latency-sensitive applications to minimize cost while satisfying performance constraints across the edge/IoT and cloud resources.
- Evaluation of Security Algorithms on Time-Triggered Networked Control Systems
- Implemented and evaluated Message-based Authentication (HMAC) protocol in the TTTech device’s Linux real-time kernel.
Graduate Teaching Assistant | Aug 2012-Apr 2014
- Assisted 50 students in Programming Languages course (2 semesters)
- Assisted 50 students in Cloud Computing course (1 semester)
- Assisted 20 students in Compiler Design course (1 semester)
INDUSTRIAL EXPERIENCE
Arm, Austin, TX
Cloud Research Summer Intern | May 2018 – Aug 2018
- Developed Engineering Analytics Platform(EAP) for large datasets on Amazon AWS Stack and Microsoft Azure stack using AWS and Azure SDK for Python.
- Used Stream Processing Engines(Kinesis, Event Hubs), EMR, DataBricks, DynamoDB, CosmosDB, ElasticSearch, Redshift, SQL DW, and Visualization tools(Kibana, PowerBI, QuickSight) to build end-to-end EAP.
- Evaluated the read/write performance of different datasets and data formats reside on several distributed file systems ( HDFS, Ceph, and S3) to design DataLake infrastructure.
TATA Consultancy Services, Kolkata, India
Systems Engineer | Dec 2009 – August 2012
- Developed workflows using PL/SQL, SQL code, and Informatica DW tool to extract, load, transform, and analyze data efficiently.
- Developed automated software to run the workflows using UNIX Shell Scripting and PL/SQL.
- Reduced 30% of processing time by introducing parallel query processing and indexing.
- Contributed code and test cases for system modules as part of the value addition and time reduction of different workflows.
- Developed an automated report generation tool to produce end-user report GUI by analyzing the business data.
- Build software for Multi-Room Hotel using JAVA MVC, UNIX, and Oracle Database Management System and implemented multiple user-level views with security.
ACADEMIC PROJECTS
- A framework for Hadoop Performance Prediction | Apr 2013
- Benchmarked different Hadoop applications with various configurations for building the performance model.
- Predicted the cost and performance (using WEKA) of Hadoop based MapReduce applications in the cloud environment.
- Android Image Synchronization Application | Dec 2012
- Downloaded images from webpage and stored in content provider.
- Synchronized the state of the Content Providers among multiple devices.
- Protein-protein interaction prediction using domain-domain interaction with Multilayer Perceptron | Apr 2011
- Developed a JAVA based prediction model to predict interactions between two protein sequences by exploiting all possible combinations of constituent domains.
- The overall MLP prediction accuracy achieved in the range of 66.70%, using twofold cross-validation.
AWARDS AND ACHIEVEMENTS
- Awarded for Best Paper in IEEE International Conference on Cloud Engineering (IC2E), 2019 and International Symposium on Resilient Control Systems (ISRCS), 2014
EXTRACURRICULAR ACTIVITIES
- Participated and Awarded for quizzes in corporate and college competitions.
- Volunteered in Social Welfare Activities.