From Pair Programming to Scientific Discovery:
AI as Research Partner

BIO2 Symposium Keynote

Graham W. Taylor

University of Guelph / Vector Institute

August 7, 2025

Talk Overview

Part 1: AI for Biodiversity Research

  • Our team’s work
  • Connection to bioinformatics and biotechnology

Part 2: The Changing Landscape

  • AI agents in software development
  • Preparing for the workforce
  • Practical integration strategies

AI for Biodiversity ↔︎ Biodiversity for AI

AI → Biodiversity

  • Earth faces unprecedented biodiversity crisis
  • <10% of ~20M species scientifically named
  • Explosion of multimodal data:
    • 🧬 DNA barcoding
    • 📸 Imaging
    • 🎤 Bioacoustics
  • Shift from lab to “in-the-wild”
  • Foundation models across modalities

Biodiversity → AI

  • Complex, hierarchical data structures
  • Long-tailed species distribution
  • Fine-grained recognition challenges
  • Out-of-distribution performance
  • Continual learning requirements

Key Statistics

  • 5 million specimens
  • Images + DNA barcodes + taxonomic labels
  • Challenge: Incomplete taxonomic annotations
    • Many labels only to family/genus level
    • Expert annotation is difficult

Three-Way Contrastive Learning: CLIBD

CLIBD: learning and inference

Key Innovation

  • Three-way contrastive learning
  • Vision ↔︎ Text encodings of taxonomic labels ↔︎ DNA
  • Cross-modal retrieval

Advantage

Train with DNA, deploy with images only

Image courtesy of Gong et al. 2025

DNA-Supervised Vision: Cost-Effective Deployment

Retrieval examples showing cost-effective deployment

Cost Comparison

  • DNA sequencing: $$$
  • Image capture: $

Key Insights

  • DNA gives fine-grained supervision to vision
  • Train expensive, deploy cheap

Image courtesy of Gong et al. 2025

Taxonomic RAG: Beyond Direct Classification

Example output from Module I (light blue) and Module II (light green) for an image of a yellow garden spider (Photo attribution: David Illig).

Highlight

  • With Nate Lesperance (MBINF alumnus)

Image courtesy of Lesperance et al. 2025

Handling Rare Species Through Retrieval

Taxonomic RAG Architecture: RAG-based approach for handling rare species

Note

Traditional (Vision-Only) Approach

  • Great for common species
  • Fails on rare taxa
  • No uncertainty awareness

RAG Approach

  • Handles rare species via text matching
  • Leverages historical text-based records
  • Knows its confidence level

Image courtesy of Lesperance et al. 2025

From Research to Career Questions

“How should I be safely integrating AI into my workflows?”

Why You Shouldn’t Dismiss AI

The pace of change is unprecedented

And it’s accelerating

📈

The METR Study: A New Moore’s Law for AI

METR Study showing doubling time of AI task completion capabilities

Key Finding

“The length of tasks AI can do is doubling every 7 months” - METR Study

Timeline Evolution

  • GPT-4 era: Seconds to minutes of autonomy
  • Current models: Minutes to hours of autonomy
  • Trend: Exponential capability growth

Image courtesy of METR 2025

Gradual Disempowerment: A Different Kind of Risk

Three areas where humans may be displaced by AI

“We might all find ourselves struggling to hold on to money, influence, even relevance. This new world could be more friendly and humane in many ways, while it lasts… But humans would be a drag on growth.”

David Duvenaud, The Guardian, “Better at everything: how AI could make human beings irrelevant”

The Competitive Reality

Organizations with Humans

  • Speed: Slower
  • Cost: 2x more expensive
  • Reliability: Variable
  • Outcome: Outcompeted

Organizations with AI

  • Speed: Faster
  • Cost: 50% cheaper
  • Reliability: Consistent
  • Outcome: Market dominance

Organizations that don’t adapt to AI will be displaced by those that do

The Solution: Strategic Integration

Avoiding AI accelerates displacement

Three pillars for thriving in the AI era:

  • 🤝 Learn to work WITH AI
  • 🧠 Maintain human capabilities
  • 🚀 Stay competitive & relevant

Skills Being Automated

🔍 Web Search & Literature Review

  • LLM-powered search replacing Google
  • Synthesis in seconds vs hours
  • Multiple sources browsed automatically

💻 Software Development

  • GitHub Copilot, Cursor, Claude Code
  • AI agent: 660 commits, top contributor
  • From autocomplete to autonomous development

💡 Data Analysis & Ideation

  • Creative tasks being automated
  • Research ideation affected
  • No longer “human-only” territory

Web Search Revolution

“I can feel my usage of Google search taking a nosedive already. I expect a bumpy ride as a new economic model for the Web lurches into view.” - Simon Willison

  • Quick Search: 15-30 seconds, 20-40 sources
  • Deep Research: 15-30 minutes, comprehensive report

o4-mini-high search response

Deep research question interface

Deep research comprehensive report

Specialized Research Tools

Elicit interface for systematic reviews

Elicit - Systematic Reviews

  • Extract specific attributes from papers
  • High-quality sources only
  • Cost: $$$ but streamlined

Other Tools

  • Your Agent + Semantic Scholar API
  • ResearchRabbit
  • Connected Papers

Key Benefit

Quality UI/UX, easy export to other formats

Image courtesy of Elicit

Agentic Coding: The Paradigm Shift

Evolution of AI Coding Tools

  1. Autocomplete (GitHub Copilot) → seconds of autonomy
  2. Pair Programming (Cursor) → minutes of autonomy
  3. Agentic Development (Claude Code) → hours of autonomy

And the result?

“Catnip for programmers”
- Armin Ronacher

Pairing with Zed AI agent in synchronous mode

Two Approaches: Synchronous vs. Asynchronous

Synchronous (Cursor)

  • Work in IDE
  • Seconds to minutes
  • Pair programming
  • Direct oversight

Asynchronous (Claude Code)

  • Work via issues/PRs
  • Minutes to hours
  • Project management
  • Multiple agents

Videos courtesy of Simon Willison and Kushagrasikka

The Future: Nerd Managers

Workflow Evolution

  1. Open GitHub issue
  2. Agent works autonomously
  3. Returns with PR
  4. Review and merge

Future Vision

“Developers will be empowered to keep work queues full in large fleets of coding agents” - Steve Yegge

Video courtesy of All Hands AI

What skills are needed for agentic development?

Graham Neubig from All Hands AI suggests these key skills:

  1. 🏗️🥧 Strong architecture skills and taste
  2. 💬 Communication
  3. 📖 Code reading
  4. 🔄 Multitasking

Taste and RLHF

Distribution of Average Quality and Empathy Ratings for Chatbot and Physician Responses to Patient Questions showing two panels: Panel A displays quality ratings distributions comparing chatbot vs physician responses, with chatbots showing higher ratings; Panel B shows empathy ratings distributions where physicians and chatbots have more similar distributions
Figure 1
Imitation (SFT) RLHF Why the difference matters
Model copies full human output distribution. That includes occasional mistakes and mediocre phrasing. Ceiling ≈ human average. Model samples its own answers and a human simply picks the better one. Humans don’t have to create perfection, only recognise it. Humans are far stronger critics than creators. Preference-grading lets them steer the model away from the left tail (bad answers) and pull the whole distribution rightward.
Training signal = “produce exactly what a human would have written.” Training signal = “move towards whichever candidate the human preferred.” Over many iterations the reward model keeps nudging the policy towards the best-judged answers, eventually surpassing the median human.

Image courtesy of Ayers et al. 2023

Developing Taste in Graduate School

Practice Deliberate Comparison

  • Generate multiple AI outputs for the same task and practice choosing which is better, articulating your reasoning
  • Develop familiarity with different frontier models and learn what each excels at
    • Claude
    • ChatGPT
    • Gemini
  • Platform selection itself is an exercise in taste

Engage in Structured Peer Collaboration

  • Seek out reviewing opportunities for conferences and journals to practice evaluating others’ work
  • Participate in code review with lab mates or open source projects to develop technical judgment
  • Don’t work in isolation - co-author papers with both senior and junior students, getting comfortable with giving and receiving feedback

Seek Active Mentorship

  • Ask experienced researchers to walk through their decision-making when they assess quality
  • Submit your own quality judgments to mentors for validation and refinement

No “Taste 101” course - learn by doing

The AI Execution Gap

Study details:

  • Prior studies found settings where LLM-generated research ideas were judged as more novel than human-expert ideas
  • This study: 43 researchers, 100+ hours each
  • Randomly assigned to execute AI-generated or human-expert ideas

Chart showing the AI execution gap - comparison of novelty, excitement, effectiveness and overall scores before and after execution for both AI-generated and human-expert research ideas, demonstrating that while AI ideas may seem more novel initially, they suffer larger drops after execution

Image courtesy of Si et al. 2025

Are Junior Developers in Trouble?

MYTH: Junior developers are doomed

REALITY: You’re best positioned to succeed

  • Quick to adopt AI-driven workflows
  • Treat AI tools as on-the-job training
  • Unburdened by legacy tooling

“Junior devs are vibing. They get it. The world is changing, and you have to adapt. So they adapt!”

“It’s not AI’s job to prove it’s better than you. It’s your job to get better using AI.” - Steve Yegge, “Revenge of the Junior Developer”

Two Simple Rules

Rule 1

“AI can be used to avoid learning, and AI can be used to assist learning”

Rule 2

“It’s ok to ask AI to do things you already know how to do,
but don’t ask AI to do things that you don’t know how to do”

Threading the Needle

Finding the right balance between learning with AI and automating with AI

Balance scale showing the sweet spot between learning and automation

Key Takeaways

  1. Unprecedented change - AI capabilities doubling every 7 months

  2. Avoiding AI accelerates displacement - Engage strategically instead

  3. Develop your taste - The ability to discriminate quality remains crucial

  4. Junior professionals have advantages - Adaptability + experience

  5. Use AI to assist learning, not avoid it - Build skills while automating

Thank You & Resources

Key Papers & Reports

Contact

www.gwtaylor.ca   Find me at posters!