Category: Data_Science_Masters

  • Mpox Viral Sentiments: LLM and BERt-based Approaches to Sentiment Analysis

    Project Overview:  This project explored use of social media and news data (Meta, X, Reddit, Google News) and techniques to identify the changing social sentiment regarding Mpox, a viral disease that emerged as a global epidemic in 2022, with North American focus and again in summer 2024, within Africa.  The team partnered with Dr. Bouchra…

    Continue reading →

  • Data Science RAG Model

    Retrieveal Augmented Generation Model using LangChain, Weaviate, and OpenAI ChatGPT Overview: This project leverages a RAG, or Retrieval Augmented Generation, model and incorporates a large language model to retrieve answers to questions only from specified materials. The recommended readings, lecture notes, and syllabus from CS89B Natural Language Processing have been used to test RAG model…

    Continue reading →

  • Learning the Basics: Neural Networks for Bird Species Prediction

    Project Overview:A hallmark of effective prediction applications is access to a large, high-quality dataset that captures the underlying dynamics and variables. Bird enthusiasts have been recording bird songs for decades, creating deep repositories of labeled audio data. These resources enable the development of models that can match new audio recordings to specific bird species. This…

    Continue reading →

  • Pharmacy Interoperability Clinical DataWarehouse

    Project Overview:  This project was done for a databases class as a demonstration project.  It was a precursor to my getting involved with the Sequoia Foundation’s Pharmacy Interoperability task force. Approach:  Python Code was used to parse a FHIR json patient data sample, and then autogenerate SQL database insert statements. A MySQL demonstration database was…

    Continue reading →

  • Chemical Diffusion Model Exploration

    The APM 115 course is geared as a pre-thesis class for Applied Math majors and I took it to practice the math that I’d forgotten from my undergraduate chemical engineering degree. It was one of my favorite classes in the program and reminded me of why I had liked math enough to pursue a degree…

    Continue reading →

  • Survival Analytics Approaches for Adherence

    Overview: Understanding the drivers behind medication adherence helps pharmacies, payors and drug manufacturers develop interventions that may improve patient adherence.  Diabetes and statin medications, in particular, help treat chronic conditions that require daily medications and are interesting examples with large patient populations.  The goal of this study is to examine how survival analytics methodologies may…

    Continue reading →