-
Data Scientist Intern at Cowen Inc.: Working with Kyber Data Science, LLC (a wholly owned subsidiary of Cowen Inc.) to develop healthcare datasets including EMR, Claims and more for Investment Managers. This role includes the aggregation and cleansing of datasets in the Healthcare pipeline and deriving insights from this data.
-
Graduate Teaching Assistant at NYU Center for Data Science: Graduate Teaching Assistant for the Graduate course DS-GA 1004: Big Data Course taught by Brian McFee, Assistant Professor of Music Technology and Data Science at NYU. Topics include: Relational Databases, Map-reduce, HDFS, Spark, Recommender Systems
-
Data Scientist (Capstone) at GeneCentrix Inc.: Worked on the detection of novel drug-target interactions using NLP across various biomedical text sources for top selling small molecule drugs in the US. Used targeted Bio NLP tools and techniques (scispaCy, BioBERT) to automate the identification of existing drug-target interactions with an AUC of 0.87. Worked in collaboration with GeneCentrix Inc. via NYU Center for Data Science for Capstone for my Fall 2020 semester. This Capstone project contributed to the company’s Drug Profiling Platform.
-
Graduate Research Assistant in Data Science Summer Incubator Program: Worked on non-invertible, privacy preserving representation learning for audio using OpenL3. The pipeline includes extracting features from audio (its melspectrogram) by using OpenL3 and building an inversion model which uses the embeddings obtained from the OpenL3 model to reconstruct the original audio (its melspectrogram). We worked on limiting the reconstruction of audio from its features by adding counter measures such as noise injection to preserve privacy. The aim is for this reconstructed audio to have enough features to be useful for a downstream task but keeping the privacy of its content preserved. Mentor: Brian McFee, Assistant Professor of Music Technology and Data Science at NYU.
-
Hackathon: Participated in a datathon organized by Understood.org and hosted by NYU Center for Data Science. My team and I built a recommendation engine for articles and presented our work at the end of the 8-hour hackathon. This recommendation engine was designed to show relevant content to users based on their previous history, interests and/or similarity with respect to other users. We secured First place at the datathon among 11 teams. Click here to see the repo
-
Data Science Intern at Verzeo: Worked on deploying a Q&A system in EdTech. Used cosine similarity between sentences to find answers to questions from Verzeo’s online course database. Responsible for database management using MySQL Workbench.
-
Data Science Intern at 42hertz INC (acquired by Cisco): Worked on implementing a CLV prediction model using Pareto/NBD model and the Lifetimes library in Python. See post here. Also was part of the end-to-end implementation right from designing the architecture to testing and deployment of a recommender system for Shopify, the E-commerce solutions platform. The recommender system is an app that can be found in the Shopify App Store. Click here to see the app. I worked closely with Jagadeesh Dyaberi, the VP of Engineering and Delivery at 42hertz INC.
-
Research Intern at Indian Institute of Astrophysics: Responsible for analysis of emission lines in the spectral data of a classical nova obtained from the Himalayan Chandra Telescope using Image Reduction and Analysis Facility. I wrote a paper at the end of the internship that summarizes my analysis and findings. Click here to see the paper. Selected by Indian Academy of Science’s Summer Research Fellowship Program and was among 194 selected candidates across India. Selected by Dr. G.C. Anupama, Dean of Indian Institute of Astrophysics and the first woman to serve as President of the Astronomical Society of India.