Get fluent with agentic AI โ then point it at real science.
A short, self-paced ramp for new researchers. By the end you'll understand how agentic AI actually works, be comfortable driving Claude Code, and have a project idea that fits the Roberts Lab's work in marine genomics and aquaculture.
How to use this site
Work top to bottom. Concepts first so the tools make sense, then hands-on practice, then you'll shape a project of your own. Check off boxes as you go โ your progress saves in this browser.
Understand
Modules 1โ2 explain what agentic AI is and the four building blocks you'll hear about all summer.
Practice
Modules 3โ6 get Claude Code running and walk you through real, low-stakes exercises.
Apply
Modules 7โ9 connect everything to the lab and help you draft a project pitch for your mentor.
What is "agentic" AI?
You've probably used a chatbot: you ask, it answers. An agent goes a step further โ it can take actions in a loop to accomplish a goal you give it.
A chatbot
Predicts text. You ask "How do I count reads in a FASTQ file?" and it explains the command. You then run it.
An agent
Can run the command, read the output, notice the file was gzipped, adjust, re-run, and report back โ repeating until the task is done.
The agent loop
Almost every agentic tool works the same way under the hood:
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ 1. You give a goal โ
โ 2. The model decides on an action (a "tool") โ
โ 3. The tool runs (read a file, run code...) โ
โ 4. The model reads the result โ
โ 5. Done? โ answer. Not done? โ back to 2. โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
What agents are good (and not good) at
| Great fit | Be careful |
|---|---|
| Writing & debugging scripts, exploring unfamiliar code, reformatting data, drafting documentation, summarizing logs, scaffolding analyses. | Anything where being subtly wrong is costly: statistical interpretation, biological claims, novel methods. You stay the scientist. Verify outputs. |
A useful mental model: the eager new lab member
Treat the agent like a fast, tireless, slightly overconfident junior collaborator. It's read an enormous amount but has never worked in your lab. It will happily do exactly what you ask โ so clear instructions, good context, and checking its work matter just as much as they would with a person.
The four words you'll hear all summer
LLM, agent, skill, MCP. These come up constantly in OPUS-AAI Week 1. Here's what each one actually means.
LLM
Large Language Model. The engine โ a model trained to predict text (Claude, GPT, etc.). On its own it only produces words. Everything "agentic" is built on top.
Agent
An LLM wrapped in the loop from Module 1, plus tools (read files, run commands, search the web). Claude Code is an agent that lives in your terminal.
Skill
A reusable packet of instructions (and sometimes scripts) that teaches an agent how to do a specific task well โ e.g. "run an RNA-seq QC the lab's way." Write it once, reuse it forever.
MCP
Model Context Protocol. A standard way to plug an agent into outside systems โ a database, GitHub, a literature search, Slack. It's how agents reach beyond your laptop.
Set up Claude Code
Claude Code is a terminal-based agent. Don't worry if the terminal is new โ you'll mostly type plain English to it.
Open a terminal
macOS: open Terminal (Cmd+Space, type "terminal"). Windows: install WSL or use the lab's machine. Linux: you already know.
Install Node.js, then Claude Code
Claude Code needs Node.js (v18+). Check, then install:
# check if you already have node
node --version
# install Claude Code (once Node is present)
npm install -g @anthropic-ai/claude-code
Official, always-current install steps: docs.claude.com/en/docs/claude-code.
Point it at the program's endpoint
Your Wednesday session will give you the exact endpoint URL and key. You'll set them as environment variables โ write down what they hand you and keep keys private (never commit them to GitHub).
Launch it inside a project folder
cd ~/my-first-project
claude
You're in. Claude Code can now see the files in that folder and help you work with them.
- Terminal opens and I can type commandsTry
pwdandls node --versionprints a version โฅ 18- Claude Code installed and
claudelaunches - Connected to the program endpoint (mentor confirmed)
Your first session
The whole interface is a conversation. Type what you want in plain English. Here's a gentle on-ramp.
Try these, in order
> What files are in this folder, and what does each one do?
> Make a file called hello.py that prints today's date.
> Run it. If it errors, fix it and run it again.
> Now add a function that counts lines in a text file,
and write a tiny test for it.
Notice what's happening: Claude Code proposes actions, you approve them, it reads the results and keeps going. That's the agent loop from Module 1, live.
Three things to know early
It asks permission
By default it confirms before editing files or running commands. Read what it's about to do before approving.
Context resets
Start a fresh session for a new task. Use a CLAUDE.md file (next module) to give it lasting context.
It can be wrong
Confidently. Always sanity-check results, especially numbers and biology.
- Had a back-and-forth conversation with the agent
- Watched it create, run, and fix a file
- Approved (and declined) at least one proposed action
Starter exercises
Small, realistic tasks that build the muscles you'll use on a real project. Do at least three. Marine-data flavored, but no biology background required.
A. Tidy a messy CSV
Make a small CSV with inconsistent columns (mixed date formats, blank cells, weird headers). Ask the agent to load it, describe the problems, and write a script that cleans it. Then ask it to explain each cleaning step back to you.
B. Explain unfamiliar code
Find a short bioinformatics script online (or a lab repo file). Ask: "Walk me through this line by line, and tell me what could break it." Great for getting up to speed on a new codebase.
C. Plot some real data
Download a small public dataset (e.g. an NCBI sample sheet, or temperature time-series). Ask the agent to load it, summarize it, and make two clear figures. Then ask it to improve the figures for a presentation.
D. Version-control a tiny project
Ask the agent to initialize a git repo for one of the above, write a .gitignore and a short README.md, and explain what each commit is doing. (Week 4 of OPUS-AAI is all about version control โ get a head start.)
E. Write your first skill
Pick something you did twice above (e.g. "clean a CSV the way we like it"). Ask the agent to help you capture it as a reusable SKILL.md with clear steps. Run it on a new file to confirm it works.
- Exercise A โ tidy a CSV
- Exercise B โ explain code
- Exercise C โ plot data
- Exercise D โ version control
- Exercise E โ write a skill (stretch)
Working well with agents
The difference between frustrating and magical is mostly how you set things up. A few habits go a long way.
Give context
Keep a CLAUDE.md in your project describing the goal, the data, the tools, and lab conventions. The agent reads it automatically every session.
Be specific
"Make a plot" โ "Make a line plot of temperature over time, label axes, save as PNG at 300 dpi." Specific in, useful out.
Work in small steps
Ask for one thing, check it, then the next. Easier to catch mistakes than reviewing a giant change at once.
Make it show its work
"Explain your plan before you start." "Why did you choose that?" You learn faster and catch bad assumptions.
Verify everything important
Re-run, spot-check numbers, read the code. The agent is a collaborator, not an oracle.
Protect secrets & data
Never paste API keys or sensitive data. Use .gitignore. Ask your mentor before pushing anything public.
About the lab
You'll spend the summer applying these tools to real questions. The Roberts Lab studies how marine invertebrates โ especially shellfish like oysters โ respond to a changing ocean, using genomics and reproducible computation.
What we study
Environmental physiology and epigenetics of shellfish; how stressors (heat, pH, pathogens, low oxygen) shape gene expression and DNA methylation; and what that means for aquaculture and resilient food systems.
How we work
Heavily computational and open: public lab notebooks, code on GitHub, reproducible pipelines (RNA-seq, WGS, methylation, ATAC-seq), and lots of data wrangling. This is fertile ground for agentic AI.
Project idea starters
Good OPUS-AAI projects pair a real lab need with the agentic methods you just learned โ and produce a playbook others can reuse. Here are seeds. Make them yours, or invent your own.
1 ยท An agentic reproducibility checker
Build a workflow (and skill) that takes one of the lab's analyses and uses an agent to check it reproduces from raw data: are paths right, dependencies recorded, steps documented? Output a "reproducibility report."
2 ยท From messy sample sheet to clean dataset
Many projects start with inconsistent metadata. Create an agent-assisted pipeline that validates and standardizes sample sheets (species, treatment, tissue, accession) and flags problems before analysis.
3 ยท Re-analyze a public shellfish dataset
Pick a public RNA-seq study (GEO/SRA) on oyster or shellfish stress response. Use an agent to help run QC, expression, and a simple differential-expression analysis end to end โ documenting every prompt and decision.
4 ยท Literature-to-table assistant
Build an agent workflow that, given a research question (e.g. "ocean acidification effects on oyster larvae"), gathers papers and extracts a structured comparison table (species, stressor, finding) โ with sources cited and checked.
5 ยท Make a pipeline self-documenting
Take an existing lab script or pipeline and use agents to generate clear docs, a diagram of the workflow, and a README that a brand-new student could follow. (Ties into OPUS-AAI Week 9: workflow figures.)
6 ยท A lab "skills library"
Identify 3โ5 tasks the lab does repeatedly and package each as a tested skill (with the agent's help). Deliver a starter skills library the whole lab can adopt โ a playbook in the most literal sense.
Sizing your idea
A good 10-week project is narrow enough to finish and produce a clear deliverable. Run any idea through these:
| Ask yourself | Why |
|---|---|
| Is there a real lab need behind it? | Keeps you motivated and makes the playbook genuinely useful. |
| Can a first version work in ~2 weeks? | You want something working early, then improve it. |
| Does it exercise agentic methods (not just a script)? | That's the point of the program. |
| Is the data available now? | Waiting on data kills summer projects. |
| Will it produce a clear final artifact + playbook? | Required deliverable; plan for it from day one. |
Draft your pitch
Before you meet your mentor, fill in this one-paragraph pitch. It forces clarity and gives them something concrete to react to. (Practice for the Week 9 non-expert pitch and the Week 10 final presentation.)
The template
PROJECT: <one-line name>
PROBLEM: The lab currently struggles to ______,
which matters because ______.
APPROACH: I'll use agentic AI to ______
(which agent / skills / MCP tools: ______).
DATA: I'll use ______ (already available? yes/no).
WEEK-2 GOAL: A first working version that ______.
DELIVERABLE: A ______ plus a playbook documenting
how others can reuse this approach.
UNKNOWNS: The biggest thing I'm unsure about is ______.
- Skimmed the lab's notebooks / Current Findings / a repo
- Picked or invented a project idea
- Ran it through the sizing questions
- Filled in the pitch template
- Pressure-tested it with the agent
- Ready to discuss with my mentor
The playbook (your big deliverable)
Every team delivers a playbook: a practical guide to using agentic AI in your research area. The good news โ if you take notes as you go, it nearly writes itself.
Capture as you work
Keep a running playbook.md from week one. Each time something works (or fails instructively), jot it down. By Week 10 you'll have a real document, not a panicked weekend.
A solid playbook usually includes
- The problem โ what research task this addresses.
- Setup โ tools, endpoints, and how to get running.
- Workflows โ the step-by-step approaches that worked, with example prompts.
- Skills โ any reusable skills you built, and how to use them.
- Gotchas โ what went wrong, what to watch for, how to verify outputs.
- Results โ what you actually accomplished, with a figure or two.
- Next steps โ what you'd do with more time.
Resources & quick reference
When you're stuck
- Ask the agent itself: "I'm getting this error โ what does it mean?"
- Ask your mentor or a senior student in the group.
- Bring it to the Wednesday standup / midterm review (Week 5).
- Search the docs above before assuming something's broken.
Glossary
| Term | Plain meaning |
|---|---|
| LLM | The text-prediction model that powers everything. |
| Agent | An LLM that can take actions in a loop to reach a goal. |
| Skill | A reusable, packaged instruction set for a task. |
| MCP | A standard plug for connecting agents to outside tools/data. |
| Context | What the agent currently "knows" โ files, history, instructions. |
CLAUDE.md | A project file the agent reads automatically for context. |
| Repo | A version-controlled project folder (git/GitHub). |
| Standup | Quick daily check-in: did / doing / blocked. |
You're ready. Bring your pitch to your mentor, keep notes for the playbook, and remember: you're the scientist โ the agent just helps you move faster. ๐