In [1]:

About

In [2]:

👋 Hello! I'm James.

In [3]:

👋 I am an undergraduate student at UC Berkeley pursuing Bachelor’s degrees in Computer Science and Data Science.

💻 My technical interests lie in software engineering, AI applications, and CS education. I am passionate about building systems where users can productively interact with AI.

🎙️ Would love to chat! See the Contact tab for my email and social media profiles.

In [4]:

Move your cursor over the terrain to warp the topography.

In [1]:

Experience

In [2]:

Idler (YC S25)

Nov 2025 — Present

QA Engineer

Part-Time

Reviewing production-grade coding problems under Widget Factory, the quality-focused division of Idler (YC S25).

Ensure production-grade coding failures generated by AI agents are fair, well-specified, and coherent before training
Conduct code review to improve reinforcement learning environments that prepare models for production-grade coding

In [3]:

UC Berkeley Data 100

Jun 2024 — Present

Lead Teaching Assistant

Part-Time

Teaching 1200+ students in the UC Berkeley's upper-division data science course, Data 100: Principles and Techniques of Data Science.

Teach 1200+ students in the upper-division course Data 100, covering data processing, statistical/probabilistic foundations, exploratory data analysis/visualization, feature engineering, dimensionality reduction, predictive modeling, and optimization
Streamline course website navigation by developing in Jekyll and automating deployment workflows through GitHub Pages
Lead discussion sections to 40+ students, host office hours, debug DataHub issues, and maintain general course infrastructure

In [4]:

Oracle

Sep 2025 — Dec 2025

Software Engineer

Contract

Engineered an AI tutor built on Oracle Cloud Infrastructure to promote active learning for UC Berkeley data science students.

Create an end-to-end AI tutor on Oracle Cloud Infrastructure with a RAG pipeline built on Oracle Database vectors, delivering source-grounded answers within a 2-second p50 latency SLO for 60+ concurrent users
Implement keyword/semantic retrieval using Autonomous AI Databases and vector embeddings for relevance
Construct a resilient data layer for 60+ users with usage stats, conversation logs, and insightful dashboards
Productionize with CI/CD (GitHub Actions), OCI autoscaling, and observability to sustain 99%+ uptime

In [5]:

Amgen

Jun 2025 — Aug 2025

AI Engineer

Internship

Created a text-to-SQL application to democratize access to marketing analytics for Amgen's Commercial Data & Analytics department.

Achieve 87%+ accuracy in text-to-SQL operations by building generative AI and testing frameworks in Databricks
Optimize Databricks SQL connection and executions by 86%+ in the worst case, reducing operations to 0–80 seconds
Deliver a Streamlit application optimized with caching to save 500+ hours and automate 60,000+ queries every year
Leverage large language models (LLMs) to automate campaign analytics across 27 brands using distributed AI agents

In [6]:

AmigoAI (SkyDeck)

Dec 2024 — May 2025

Software Engineer

Internship

Led independent projects in computer vision/optical character recognition (OCR), webscraping, and automation testing.

Devise and build an automated end-to-end testing framework using Playwright to detect regressions and improve test coverage
Engineer a robust and dynamic web scraper that mimics real-user behavior by emulating user agents, preloading consent cookies, and bypassing JavaScript-rendered content through direct HTML data extraction using Scrapy, BeautifulSoup, and Selenium
Scrape 1000+ data entries into JSON files for fine-tuning a custom-built LLM for standardized exam question generation
Construct a cross-format file processing workflow to extract text from files using PyMuPDF/Fitz and Pix2Text
Architect a computer vision pipeline that tracks real-time user progress to contextualize query processes in a fine-tuned LLM

In [7]:

UC Berkeley Data Science Modules

Jan 2025 — May 2025

Software Developer

Part-Time

Built educational data science modules through UC Berkeley's Data Science Modules Team for El Camino College courses of varying domains.

Partner with four El Camino College professors to create educational data science modules teaching probability distributions, data visualization, classification, statistical modeling, and business analytics under the Data Science Modules team
Introduce 500+ students every semester to fundamental programming practices, statistics, and data-driven decision-making

In [8]:

UCSF Abbasi Lab

Feb 2025 — Apr 2025

Research Assistant

Apprenticeship

Evaluated large language model (LLM) robustness via conversational red-teaming and multi-turn jailbreaking techniques.

Evaluate large language models (LLMs) robustness via red-teaming and multi-turn jailbreaking techniques under Abbasi Lab
Build an agentic LangChain framework to automate multi-turn LLM conversations and LLM-as-a-judge evaluations with GPT-4o
Customize and apply 10+ prompting goals and target strings from the MedSafetyBench dataset for attacker and target LLMs

In [9]:

Universal Health Services

May 2024 — Aug 2024

Data Scientist

Contract

Constructed AutoRegressive Integrated Moving Average (ARIMA) and Long Short-Term Memory (LSTM) models for financial forecasting.

Achieved <18.5% and <25% average error in financial forecasting by deploying AutoRegressive Integrated Moving Average (ARIMA) and Long Short-Term Memory (LSTM) neural network models, optimized with extensive EDA and fine-tuning
Delivered Pyramid Analytics integration, Python script, and walkthrough video; leveraged mentorship from C-Suite executives

In [10]:

UC Berkeley L&S 22

Feb 2024 — Aug 2024

Software Developer

Part-Time

Developed data science modules on Simpson's Paradox for UC Berkeley's L&S 22: Sense and Sensibility and Science.

Design and construct two data science modules deployed through Jupyter for the undergraduate interdisciplinary course LS 22
Introduce 500+ students every semester to fundamental programming practices, statistics, and data-driven decision-making

In [11]:

UW School of Pharmacy

Jan 2024 — Apr 2024

Research Assistant

Apprenticeship

Leveraged Natural Language Processing (NLP) to carry out topic modeling and sentiment analysis on textual data.

Built and optimized a latent Dirichlet allocation model to cluster textual data, resulting in a coherence score of ~0.684
Leveraged natural language processing tools to enable complex analysis of textual data beyond human observation

In [1]:

Organizations

In [2]:

Data Science Society

Sep 2023 — Present

Vice President

Jan 2024 — Dec 2025

Course Director (Spring 2024), Internal Vice President (Fall 2024), and Project Manager (Fall 2025) for UC Berkeley’s first data science undergraduate student organization.

In [3]:

Berkeley Engineers and Mentors

Sep 2025 — Present

Teaching a variety of fun and interactive STEM lessons to Berkeley / Oakland elementary school students.

In [4]:

Kapatid Mentorship Program

Mar 2023 — Present

Connecting with my Filipino roots on campus. Shoutout OTP.

In [5]:

Upsilon Pi Epsilon

Feb 2024 — Present

Computer Science Honors Society. Top third of Computer Science students.

In [6]:

Computer Science Mentors

Sep 2023 — Dec 2024

Senior Mentor

Jan 2024 — Dec 2024

Senior Content Mentor (Spring 2024) and Senior Mentor (Fall 2024) specialized in Data Structures and Algorithms (CS 61B).

In [7]:

Open Project

Feb 2023 — May 2023

Learned how to code outside of the classroom by attempting to build an NBA playoff predictor.

In [1]:

Courses

In [2]:

cs_courses = ["COMPSCI 370", "COMPSCI 294", "COMPSCI 194-271", "COMPSCI 194-244", "COMPSCI 189", "COMPSCI 188", "COMPSCI 170", "COMPSCI 161", "COMPSCI 70", "COMPSCI 61C", "COMPSCI 61B", "COMPSCI 61A"]
display(cs_courses)

Out[2]:

COMPSCI 370 Instructional Techniques COMPSCI 294 Computer Science Education COMPSCI 194-271 AI Education Research COMPSCI 194-244 CS Education Research COMPSCI 189 Machine Learning COMPSCI 188 Artificial Intelligence COMPSCI 170 Efficient Algorithms COMPSCI 161 Computer Security COMPSCI 70 Discrete Math & Probability COMPSCI 61C Machine Structures COMPSCI 61B Data Structures & Algorithms COMPSCI 61A Computer Programs

In [3]:

data_courses = ["DATA 145", "DATA 144", "DATA 140", "DATA 104", "DATA 102", "DATA 101", "DATA 100", "DATA 8"]
display(data_courses)

Out[3]:

DATA 100 Advanced Data Science DATA 145 Evidence & Uncertainty DATA 144 Data Analytics DATA 140 Probability Theory DATA 104 Data Ethics DATA 102 Inference DATA 101 Data Engineering DATA 8 Data Science

In [4]:

econ_courses = ["ECON 140", "ECON 100A"]
display(econ_courses)

Out[4]:

ECON 140 Econometrics ECON 100A Microeconomics

In [5]:

math_courses = ["MATH 54"]
display(math_courses)

Out[5]:

MATH 54 Linear Algebra

In [1]:

Contact

In [2]:

primary_contacts = { "calendly": "calendly.com/jegeronimo/chat", "email": "jegeronimo@berkeley.edu" }
display(primary_contacts)

Out[2]:

Calendly

@jegeronimo/chat

jegeronimo@berkeley.edu

In [3]:

social_profiles = { "linkedin": "@james-geronimo", "handshake": "@james-geronimo", "github": "@jegeronimo" }
display(social_profiles)

Out[3]:

@james-geronimo

Handshake

@james-geronimo

GitHub

@jegeronimo

In [4]:

resume = { "name": "James Geronimo", "format": "PDF", "status": "Available" }
display(resume)

Out[4]:

Resume

James Geronimo