jegeronimo Last Checkpoint: Mar 01, 2026
Email LinkedIn Handshake GitHub Trusted
Code
Open in...
Python 3 (ipykernel)
Format SQL
about.ipynb
experience.ipynb
organizations.ipynb
courses.ipynb
contact.ipynb
resume.pdf
In [1]:

About

In [2]:
James Geronimo

πŸ‘‹ Hello! I'm James.

In [3]:

πŸ‘‹ I am an undergraduate student at UC Berkeley pursuing Bachelor’s degrees in Computer Science and Data Science.

πŸ’» My technical interests lie in software engineering, AI applications, and CS education. I am passionate about building systems where users can productively interact with AI.

πŸŽ™οΈ Would love to chat! See the Contact tab for my email and social media profiles.

In [4]:

Move your cursor over the terrain to warp the topography.

In [1]:

Experience

In [2]:

Idler (YC S25)

Nov 2025 β€” Present

QA Engineer

Part-Time

Reviewing production-grade coding problems under Widget Factory, the quality-focused division of Idler (YC S25).

  • Ensure production-grade coding failures generated by AI agents are fair, well-specified, and coherent before training
  • Conduct code review to improve reinforcement learning environments that prepare models for production-grade coding
In [3]:

UC Berkeley Data 100

Jun 2024 β€” Present

Lead Teaching Assistant

Part-Time

Teaching 1200+ students in the UC Berkeley's upper-division data science course, Data 100: Principles and Techniques of Data Science.

  • Teach 1200+ students in the upper-division course Data 100, covering data processing, statistical/probabilistic foundations, exploratory data analysis/visualization, feature engineering, dimensionality reduction, predictive modeling, and optimization
  • Streamline course website navigation by developing in Jekyll and automating deployment workflows through GitHub Pages
  • Lead discussion sections to 40+ students, host office hours, debug DataHub issues, and maintain general course infrastructure
In [4]:

Oracle

Sep 2025 β€” Dec 2025

Software Engineer

Contract

Engineered an AI tutor built on Oracle Cloud Infrastructure to promote active learning for UC Berkeley data science students.

  • Create an end-to-end AI tutor on Oracle Cloud Infrastructure with a RAG pipeline built on Oracle Database vectors, delivering source-grounded answers within a 2-second p50 latency SLO for 60+ concurrent users
  • Implement keyword/semantic retrieval using Autonomous AI Databases and vector embeddings for relevance
  • Construct a resilient data layer for 60+ users with usage stats, conversation logs, and insightful dashboards
  • Productionize with CI/CD (GitHub Actions), OCI autoscaling, and observability to sustain 99%+ uptime
In [5]:

Amgen

Jun 2025 β€” Aug 2025

AI Engineer

Internship

Created a text-to-SQL application to democratize access to marketing analytics for Amgen's Commercial Data & Analytics department.

  • Achieve 87%+ accuracy in text-to-SQL operations by building generative AI and testing frameworks in Databricks
  • Optimize Databricks SQL connection and executions by 86%+ in the worst case, reducing operations to 0–80 seconds
  • Deliver a Streamlit application optimized with caching to save 500+ hours and automate 60,000+ queries every year
  • Leverage large language models (LLMs) to automate campaign analytics across 27 brands using distributed AI agents
In [6]:

AmigoAI (SkyDeck)

Dec 2024 β€” May 2025

Software Engineer

Internship

Led independent projects in computer vision/optical character recognition (OCR), webscraping, and automation testing.

  • Devise and build an automated end-to-end testing framework using Playwright to detect regressions and improve test coverage
  • Engineer a robust and dynamic web scraper that mimics real-user behavior by emulating user agents, preloading consent cookies, and bypassing JavaScript-rendered content through direct HTML data extraction using Scrapy, BeautifulSoup, and Selenium
  • Scrape 1000+ data entries into JSON files for fine-tuning a custom-built LLM for standardized exam question generation
  • Construct a cross-format file processing workflow to extract text from files using PyMuPDF/Fitz and Pix2Text
  • Architect a computer vision pipeline that tracks real-time user progress to contextualize query processes in a fine-tuned LLM
In [7]:

UC Berkeley Data Science Modules

Jan 2025 β€” May 2025

Software Developer

Part-Time

Built educational data science modules through UC Berkeley's Data Science Modules Team for El Camino College courses of varying domains.

  • Partner with four El Camino College professors to create educational data science modules teaching probability distributions, data visualization, classification, statistical modeling, and business analytics under the Data Science Modules team
  • Introduce 500+ students every semester to fundamental programming practices, statistics, and data-driven decision-making
In [8]:

UCSF Abbasi Lab

Feb 2025 β€” Apr 2025

Research Assistant

Apprenticeship

Evaluated large language model (LLM) robustness via conversational red-teaming and multi-turn jailbreaking techniques.

  • Evaluate large language models (LLMs) robustness via red-teaming and multi-turn jailbreaking techniques under Abbasi Lab
  • Build an agentic LangChain framework to automate multi-turn LLM conversations and LLM-as-a-judge evaluations with GPT-4o
  • Customize and apply 10+ prompting goals and target strings from the MedSafetyBench dataset for attacker and target LLMs
In [9]:

Universal Health Services

May 2024 β€” Aug 2024

Data Scientist

Contract

Constructed AutoRegressive Integrated Moving Average (ARIMA) and Long Short-Term Memory (LSTM) models for financial forecasting.

  • Achieved <18.5% and <25% average error in financial forecasting by deploying AutoRegressive Integrated Moving Average (ARIMA) and Long Short-Term Memory (LSTM) neural network models, optimized with extensive EDA and fine-tuning
  • Delivered Pyramid Analytics integration, Python script, and walkthrough video; leveraged mentorship from C-Suite executives
In [10]:

UC Berkeley L&S 22

Feb 2024 β€” Aug 2024

Software Developer

Part-Time

Developed data science modules on Simpson's Paradox for UC Berkeley's L&S 22: Sense and Sensibility and Science.

  • Design and construct two data science modules deployed through Jupyter for the undergraduate interdisciplinary course LS 22
  • Introduce 500+ students every semester to fundamental programming practices, statistics, and data-driven decision-making
In [11]:

UW School of Pharmacy

Jan 2024 β€” Apr 2024

Research Assistant

Apprenticeship

Leveraged Natural Language Processing (NLP) to carry out topic modeling and sentiment analysis on textual data.

  • Built and optimized a latent Dirichlet allocation model to cluster textual data, resulting in a coherence score of ~0.684
  • Leveraged natural language processing tools to enable complex analysis of textual data beyond human observation
In [1]:

Organizations

In [2]:

Data Science Society

Sep 2023 β€” Present

Vice President

Jan 2024 β€” Dec 2025

Course Director (Spring 2024), Internal Vice President (Fall 2024), and Project Manager (Fall 2025) for UC Berkeley’s first data science undergraduate student organization.

In [3]:

Berkeley Engineers and Mentors

Sep 2025 β€” Present

Teaching a variety of fun and interactive STEM lessons to Berkeley / Oakland elementary school students.

In [4]:

Kapatid Mentorship Program

Mar 2023 β€” Present

Connecting with my Filipino roots on campus. Shoutout OTP.

In [5]:

Upsilon Pi Epsilon

Feb 2024 β€” Present

Computer Science Honors Society. Top third of Computer Science students.

In [6]:

Computer Science Mentors

Sep 2023 β€” Dec 2024

Senior Mentor

Jan 2024 β€” Dec 2024

Senior Content Mentor (Spring 2024) and Senior Mentor (Fall 2024) specialized in Data Structures and Algorithms (CS 61B).

In [7]:

Open Project

Feb 2023 β€” May 2023

Learned how to code outside of the classroom by attempting to build an NBA playoff predictor.

In [1]:

Courses

In [2]:
cs_courses = ["COMPSCI 370", "COMPSCI 294", "COMPSCI 194-271", "COMPSCI 194-244", "COMPSCI 189", "COMPSCI 188", "COMPSCI 170", "COMPSCI 161", "COMPSCI 70", "COMPSCI 61C", "COMPSCI 61B", "COMPSCI 61A"]
display(cs_courses)
Out[2]:
COMPSCI 370 Instructional Techniques COMPSCI 294 Computer Science Education COMPSCI 194-271 AI Education Research COMPSCI 194-244 CS Education Research COMPSCI 189 Machine Learning COMPSCI 188 Artificial Intelligence COMPSCI 170 Efficient Algorithms COMPSCI 161 Computer Security COMPSCI 70 Discrete Math & Probability COMPSCI 61C Machine Structures COMPSCI 61B Data Structures & Algorithms COMPSCI 61A Computer Programs
In [3]:
data_courses = ["DATA 145", "DATA 144", "DATA 140", "DATA 104", "DATA 102", "DATA 101", "DATA 100", "DATA 8"]
display(data_courses)
Out[3]:
DATA 100 Advanced Data Science DATA 145 Evidence & Uncertainty DATA 144 Data Analytics DATA 140 Probability Theory DATA 104 Data Ethics DATA 102 Inference DATA 101 Data Engineering DATA 8 Data Science
In [4]:
econ_courses = ["ECON 140", "ECON 100A"]
display(econ_courses)
Out[4]:
ECON 140 Econometrics ECON 100A Microeconomics
In [5]:
math_courses = ["MATH 54"]
display(math_courses)
Out[5]:
MATH 54 Linear Algebra
In [1]:

Contact

In [2]:
primary_contacts = { "calendly": "calendly.com/jegeronimo/chat", "email": "jegeronimo@berkeley.edu" }
display(primary_contacts)
Out[2]:

Calendly

@jegeronimo/chat

Email

jegeronimo@berkeley.edu

In [3]:
social_profiles = { "linkedin": "@james-geronimo", "handshake": "@james-geronimo", "github": "@jegeronimo" }
display(social_profiles)
Out[3]:

LinkedIn

@james-geronimo

Handshake

@james-geronimo

GitHub

@jegeronimo

In [4]:
resume = { "name": "James Geronimo", "format": "PDF", "status": "Available" }
display(resume)
Out[4]:

Resume

James Geronimo

Simple 2 main Python 3 (ipykernel) | Idle Mem: 100.30 / 2004.08 MB
Mode: Command Ln 0, Col 0 jegeronimo 0