OPUS-AAI 2026 ยท 10-week summer program

Get fluent with agentic AI โ€” then point it at real science.

A short, self-paced ramp for new researchers. By the end you'll understand how agentic AI actually works, be comfortable driving Claude Code, and have a project idea that fits the Roberts Lab's work in marine genomics and aquaculture.

โฑ๏ธ ~4โ€“6 hours, self-paced ๐Ÿงฌ Marine genomics framing ๐Ÿ’ป Claude Code focused โœ… No prior AI experience needed
Start here

How to use this site

Work top to bottom. Concepts first so the tools make sense, then hands-on practice, then you'll shape a project of your own. Check off boxes as you go โ€” your progress saves in this browser.

๐Ÿ“–

Understand

Modules 1โ€“2 explain what agentic AI is and the four building blocks you'll hear about all summer.

๐Ÿ› ๏ธ

Practice

Modules 3โ€“6 get Claude Code running and walk you through real, low-stakes exercises.

๐Ÿ”ฌ

Apply

Modules 7โ€“9 connect everything to the lab and help you draft a project pitch for your mentor.

What you'll deliver this summer. Each team produces a playbook โ€” a practical guide to using agentic AI in your research area. Keep notes from day one; module 10 shows you how the pieces add up.
Module 1

What is "agentic" AI?

You've probably used a chatbot: you ask, it answers. An agent goes a step further โ€” it can take actions in a loop to accomplish a goal you give it.

A chatbot

Predicts text. You ask "How do I count reads in a FASTQ file?" and it explains the command. You then run it.

An agent

Can run the command, read the output, notice the file was gzipped, adjust, re-run, and report back โ€” repeating until the task is done.

The agent loop

Almost every agentic tool works the same way under the hood:

   โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
   โ”‚  1. You give a goal                           โ”‚
   โ”‚  2. The model decides on an action (a "tool") โ”‚
   โ”‚  3. The tool runs (read a file, run code...)  โ”‚
   โ”‚  4. The model reads the result                โ”‚
   โ”‚  5. Done? โ†’ answer.  Not done? โ†’ back to 2.   โ”‚
   โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
Why this matters for research. A lot of science is exactly this loop: try something, look at the result, adjust. Agents are useful precisely where there's iteration โ€” debugging a pipeline, wrangling messy data, exploring a new dataset โ€” not just where there's a single right answer.

What agents are good (and not good) at

Great fitBe careful
Writing & debugging scripts, exploring unfamiliar code, reformatting data, drafting documentation, summarizing logs, scaffolding analyses. Anything where being subtly wrong is costly: statistical interpretation, biological claims, novel methods. You stay the scientist. Verify outputs.
A useful mental model: the eager new lab member

Treat the agent like a fast, tireless, slightly overconfident junior collaborator. It's read an enormous amount but has never worked in your lab. It will happily do exactly what you ask โ€” so clear instructions, good context, and checking its work matter just as much as they would with a person.

Module 2

The four words you'll hear all summer

LLM, agent, skill, MCP. These come up constantly in OPUS-AAI Week 1. Here's what each one actually means.

๐Ÿง 

LLM

Large Language Model. The engine โ€” a model trained to predict text (Claude, GPT, etc.). On its own it only produces words. Everything "agentic" is built on top.

The program provides OpenAI-compatible LLM endpoints you'll plug into your tools at no cost.

๐Ÿค–

Agent

An LLM wrapped in the loop from Module 1, plus tools (read files, run commands, search the web). Claude Code is an agent that lives in your terminal.

๐Ÿ“‹

Skill

A reusable packet of instructions (and sometimes scripts) that teaches an agent how to do a specific task well โ€” e.g. "run an RNA-seq QC the lab's way." Write it once, reuse it forever.

๐Ÿ”Œ

MCP

Model Context Protocol. A standard way to plug an agent into outside systems โ€” a database, GitHub, a literature search, Slack. It's how agents reach beyond your laptop.

How they stack: an LLM is the engine โ†’ an agent adds the loop and tools โ†’ skills teach it your workflows โ†’ MCP connects it to your data and services. You'll touch all four this summer.
Module 3 ยท Hands-on

Set up Claude Code

Claude Code is a terminal-based agent. Don't worry if the terminal is new โ€” you'll mostly type plain English to it.

Before you start: the program provides LLM endpoints and access. If you hit an account, billing, or login wall, stop and ask your mentor or a Wednesday session lead โ€” don't pay for anything yourself.
1

Open a terminal

macOS: open Terminal (Cmd+Space, type "terminal"). Windows: install WSL or use the lab's machine. Linux: you already know.

2

Install Node.js, then Claude Code

Claude Code needs Node.js (v18+). Check, then install:

# check if you already have node
node --version

# install Claude Code (once Node is present)
npm install -g @anthropic-ai/claude-code

Official, always-current install steps: docs.claude.com/en/docs/claude-code.

3

Point it at the program's endpoint

Your Wednesday session will give you the exact endpoint URL and key. You'll set them as environment variables โ€” write down what they hand you and keep keys private (never commit them to GitHub).

4

Launch it inside a project folder

cd ~/my-first-project
claude

You're in. Claude Code can now see the files in that folder and help you work with them.

  • Terminal opens and I can type commandsTry pwd and ls
  • node --version prints a version โ‰ฅ 18
  • Claude Code installed and claude launches
  • Connected to the program endpoint (mentor confirmed)
Module 4 ยท Hands-on

Your first session

The whole interface is a conversation. Type what you want in plain English. Here's a gentle on-ramp.

Try these, in order

> What files are in this folder, and what does each one do?

> Make a file called hello.py that prints today's date.

> Run it. If it errors, fix it and run it again.

> Now add a function that counts lines in a text file,
  and write a tiny test for it.

Notice what's happening: Claude Code proposes actions, you approve them, it reads the results and keeps going. That's the agent loop from Module 1, live.

Three things to know early

It asks permission

By default it confirms before editing files or running commands. Read what it's about to do before approving.

Context resets

Start a fresh session for a new task. Use a CLAUDE.md file (next module) to give it lasting context.

It can be wrong

Confidently. Always sanity-check results, especially numbers and biology.

  • Had a back-and-forth conversation with the agent
  • Watched it create, run, and fix a file
  • Approved (and declined) at least one proposed action
Module 5 ยท Hands-on

Starter exercises

Small, realistic tasks that build the muscles you'll use on a real project. Do at least three. Marine-data flavored, but no biology background required.

Easy

A. Tidy a messy CSV

Make a small CSV with inconsistent columns (mixed date formats, blank cells, weird headers). Ask the agent to load it, describe the problems, and write a script that cleans it. Then ask it to explain each cleaning step back to you.

You'll learn: giving the agent a file, reviewing its plan, reading generated code.

Easy

B. Explain unfamiliar code

Find a short bioinformatics script online (or a lab repo file). Ask: "Walk me through this line by line, and tell me what could break it." Great for getting up to speed on a new codebase.

You'll learn: using agents to understand, not just write.

Medium

C. Plot some real data

Download a small public dataset (e.g. an NCBI sample sheet, or temperature time-series). Ask the agent to load it, summarize it, and make two clear figures. Then ask it to improve the figures for a presentation.

You'll learn: the explore โ†’ summarize โ†’ visualize loop you'll repeat all summer.

Medium

D. Version-control a tiny project

Ask the agent to initialize a git repo for one of the above, write a .gitignore and a short README.md, and explain what each commit is doing. (Week 4 of OPUS-AAI is all about version control โ€” get a head start.)

You'll learn: git basics with a guide riding along.

Stretch

E. Write your first skill

Pick something you did twice above (e.g. "clean a CSV the way we like it"). Ask the agent to help you capture it as a reusable SKILL.md with clear steps. Run it on a new file to confirm it works.

You'll learn: turning a one-off into a reusable workflow โ€” the heart of the playbook deliverable.

  • Exercise A โ€” tidy a CSV
  • Exercise B โ€” explain code
  • Exercise C โ€” plot data
  • Exercise D โ€” version control
  • Exercise E โ€” write a skill (stretch)
Module 6

Working well with agents

The difference between frustrating and magical is mostly how you set things up. A few habits go a long way.

Give context

Keep a CLAUDE.md in your project describing the goal, the data, the tools, and lab conventions. The agent reads it automatically every session.

Be specific

"Make a plot" โ†’ "Make a line plot of temperature over time, label axes, save as PNG at 300 dpi." Specific in, useful out.

Work in small steps

Ask for one thing, check it, then the next. Easier to catch mistakes than reviewing a giant change at once.

Make it show its work

"Explain your plan before you start." "Why did you choose that?" You learn faster and catch bad assumptions.

Verify everything important

Re-run, spot-check numbers, read the code. The agent is a collaborator, not an oracle.

Protect secrets & data

Never paste API keys or sensitive data. Use .gitignore. Ask your mentor before pushing anything public.

The standup habit. OPUS-AAI runs weekly standups (Weeks 6 & 8). Each day, jot: what I did, what I'm doing next, what's blocking me. Ask the agent to help summarize your week's work into a 3-line standup โ€” practice for the real thing.
Module 7 ยท Your project

About the lab

You'll spend the summer applying these tools to real questions. The Roberts Lab studies how marine invertebrates โ€” especially shellfish like oysters โ€” respond to a changing ocean, using genomics and reproducible computation.

What we study

Environmental physiology and epigenetics of shellfish; how stressors (heat, pH, pathogens, low oxygen) shape gene expression and DNA methylation; and what that means for aquaculture and resilient food systems.

How we work

Heavily computational and open: public lab notebooks, code on GitHub, reproducible pipelines (RNA-seq, WGS, methylation, ATAC-seq), and lots of data wrangling. This is fertile ground for agentic AI.

Before module 8, skim the lab's public materials so your project ideas land: the lab notebooks and the Current Findings site, plus a few recent repos on GitHub. Ask your mentor for the current links โ€” and note that "make our work easier to reproduce and share" is itself a great project theme.
Module 8 ยท Your project

Project idea starters

Good OPUS-AAI projects pair a real lab need with the agentic methods you just learned โ€” and produce a playbook others can reuse. Here are seeds. Make them yours, or invent your own.

1 ยท An agentic reproducibility checker

Build a workflow (and skill) that takes one of the lab's analyses and uses an agent to check it reproduces from raw data: are paths right, dependencies recorded, steps documented? Output a "reproducibility report."

AI angle: agent + skills + git ยท Lab value: open, reproducible science

2 ยท From messy sample sheet to clean dataset

Many projects start with inconsistent metadata. Create an agent-assisted pipeline that validates and standardizes sample sheets (species, treatment, tissue, accession) and flags problems before analysis.

AI angle: data wrangling + skill authoring ยท Lab value: cleaner downstream analyses

3 ยท Re-analyze a public shellfish dataset

Pick a public RNA-seq study (GEO/SRA) on oyster or shellfish stress response. Use an agent to help run QC, expression, and a simple differential-expression analysis end to end โ€” documenting every prompt and decision.

AI angle: agent-driven bioinformatics ยท Lab value: new look at existing data

4 ยท Literature-to-table assistant

Build an agent workflow that, given a research question (e.g. "ocean acidification effects on oyster larvae"), gathers papers and extracts a structured comparison table (species, stressor, finding) โ€” with sources cited and checked.

AI angle: MCP / search tools + verification ยท Lab value: faster literature synthesis

5 ยท Make a pipeline self-documenting

Take an existing lab script or pipeline and use agents to generate clear docs, a diagram of the workflow, and a README that a brand-new student could follow. (Ties into OPUS-AAI Week 9: workflow figures.)

AI angle: agents for communication ยท Lab value: onboarding & clarity

6 ยท A lab "skills library"

Identify 3โ€“5 tasks the lab does repeatedly and package each as a tested skill (with the agent's help). Deliver a starter skills library the whole lab can adopt โ€” a playbook in the most literal sense.

AI angle: skill engineering ยท Lab value: compounding time savings

Sizing your idea

A good 10-week project is narrow enough to finish and produce a clear deliverable. Run any idea through these:

Ask yourselfWhy
Is there a real lab need behind it?Keeps you motivated and makes the playbook genuinely useful.
Can a first version work in ~2 weeks?You want something working early, then improve it.
Does it exercise agentic methods (not just a script)?That's the point of the program.
Is the data available now?Waiting on data kills summer projects.
Will it produce a clear final artifact + playbook?Required deliverable; plan for it from day one.
Module 9 ยท Your project

Draft your pitch

Before you meet your mentor, fill in this one-paragraph pitch. It forces clarity and gives them something concrete to react to. (Practice for the Week 9 non-expert pitch and the Week 10 final presentation.)

The template

PROJECT:   <one-line name>

PROBLEM:   The lab currently struggles to ______,
           which matters because ______.

APPROACH:  I'll use agentic AI to ______
           (which agent / skills / MCP tools: ______).

DATA:      I'll use ______ (already available? yes/no).

WEEK-2 GOAL:  A first working version that ______.

DELIVERABLE: A ______ plus a playbook documenting
             how others can reuse this approach.

UNKNOWNS:  The biggest thing I'm unsure about is ______.
Use the agent to pressure-test it. Paste your draft and ask: "Poke holes in this 10-week project plan. What's too big? What am I assuming? What could I cut?" Bring the strongest version to your mentor.
  • Skimmed the lab's notebooks / Current Findings / a repo
  • Picked or invented a project idea
  • Ran it through the sizing questions
  • Filled in the pitch template
  • Pressure-tested it with the agent
  • Ready to discuss with my mentor
Module 10 ยท Wrap

The playbook (your big deliverable)

Every team delivers a playbook: a practical guide to using agentic AI in your research area. The good news โ€” if you take notes as you go, it nearly writes itself.

Capture as you work

Keep a running playbook.md from week one. Each time something works (or fails instructively), jot it down. By Week 10 you'll have a real document, not a panicked weekend.

A solid playbook usually includes

  • The problem โ€” what research task this addresses.
  • Setup โ€” tools, endpoints, and how to get running.
  • Workflows โ€” the step-by-step approaches that worked, with example prompts.
  • Skills โ€” any reusable skills you built, and how to use them.
  • Gotchas โ€” what went wrong, what to watch for, how to verify outputs.
  • Results โ€” what you actually accomplished, with a figure or two.
  • Next steps โ€” what you'd do with more time.
Meta move: use the agent to draft your playbook from your notes and commit history. You'll be using agentic AI to document how you used agentic AI โ€” exactly the skill the program is building.
Reference

Resources & quick reference

When you're stuck

  • Ask the agent itself: "I'm getting this error โ€” what does it mean?"
  • Ask your mentor or a senior student in the group.
  • Bring it to the Wednesday standup / midterm review (Week 5).
  • Search the docs above before assuming something's broken.

Glossary

TermPlain meaning
LLMThe text-prediction model that powers everything.
AgentAn LLM that can take actions in a loop to reach a goal.
SkillA reusable, packaged instruction set for a task.
MCPA standard plug for connecting agents to outside tools/data.
ContextWhat the agent currently "knows" โ€” files, history, instructions.
CLAUDE.mdA project file the agent reads automatically for context.
RepoA version-controlled project folder (git/GitHub).
StandupQuick daily check-in: did / doing / blocked.

You're ready. Bring your pitch to your mentor, keep notes for the playbook, and remember: you're the scientist โ€” the agent just helps you move faster. ๐ŸŒŠ