Tuesday, February 3, 2026
HomeTechnologyThe 5 Abilities I Really Use Each Day as an AI PM...

The 5 Abilities I Really Use Each Day as an AI PM (and How You Can Too) – O’Reilly

This put up first appeared on Aman Khan’s AI Product Playbook e-newsletter and is being republished right here with the writer’s permission.

Let me begin with some honesty. When individuals ask me “Ought to I turn out to be an AI PM?” I inform them they’re asking the flawed query.

Right here’s what I’ve realized: Changing into an AI PM isn’t about chasing a stylish job title. It’s about growing concrete expertise that make you simpler at constructing merchandise in a world the place AI touches all the pieces.

Each PM is changing into an AI PM, whether or not they understand it or not. Your cost movement may have fraud detection. Your search bar may have semantic understanding. Your buyer help may have chatbots.

Consider AI product administration as much less of an OR and as an alternative extra of an AND. For instance: AI x well being tech PM or AI x fintech PM.

The 5 Abilities I Really Use Each Day

This put up was tailored from a dialog with Aakash Gupta on The Progress Podcast. You’ll find the episode right here.

After ~9 years of constructing AI merchandise (the final three of which have been a whole ramp-up utilizing LLMs and brokers), listed below are the abilities I take advantage of always—not those that sound good in a weblog put up however the ones I actually used yesterday.

  • AI prototyping
  • Observability, akin to telemetry
  • AI evals: The brand new PRD for AI PMs
  • RAG versus fine-tuning versus immediate engineering
  • Working with AI engineers

1. Prototyping: Why I code each week

Final month, our design workforce spent two weeks creating stunning mocks for an AI agent interface. It regarded good. Then I spent half-hour in Cursor constructing a useful prototype, and we instantly found three elementary UX issues the mocks hadn’t revealed.

The ability: Utilizing AI-powered coding instruments to construct tough prototypes.
The software: Cursor. (It’s VS Code however you possibly can describe what you need in plain English.)
Why it issues: AI habits is unimaginable to know from static mocks.

How you can begin this week:

  1. Obtain Cursor.
  2. Construct one thing stupidly easy. (I began with a private web site touchdown web page.)
  3. Present it to an engineer and ask what you probably did flawed.
  4. Repeat.

You’re not making an attempt to turn out to be an engineer. You’re making an attempt to know constraints and prospects.

2. Observability: Debugging the black field

Observability is the way you truly peek beneath the hood and see how your agent is working.

The ability: Utilizing traces to know what your AI truly did.
The software: Any APM that helps LLM tracing. (We use our personal at Arize, however there are a lot of.)
Why it issues: “The AI is damaged” isn’t actionable. “The context retrieval returned the flawed doc” is.

Your first observability train:

  1. Choose any AI product you utilize each day.
  2. Attempt to set off an edge case or error.
  3. Write down what you assume went flawed internally.
  4. This psychological mannequin constructing is 80% of the ability.

3. Evaluations: Your new definition of “executed”

Vibe coding works if you happen to’re transport prototypes. It doesn’t actually work if you happen to’re transport manufacturing code.

The ability: Turning subjective high quality into measurable metrics.
The software: Begin with spreadsheets, graduate to correct eval frameworks.
Why it issues: You may’t enhance what you possibly can’t measure.

Construct your first eval:

  1. Choose one high quality dimension (conciseness, friendliness, accuracy).
  2. Create 20 examples of excellent and dangerous. Label them “verbose” or “concise.”
  3. Rating your present system. Set a goal: 85% of responses needs to be “excellent.”
  4. That quantity is now your new North Star. Iterate till you hit it.

4. Technical instinct: Figuring out your choices

Immediate engineering (1 day): Add model voice pointers to the system immediate.

Few-shot examples (3 days): Embrace examples of on-brand responses.

RAG with model information (1 week): Pull from our precise model documentation.

High-quality-tuning (1 month): Prepare a mannequin on our help transcripts.

Every has completely different prices, timelines, and trade-offs. My job is figuring out which to advocate.

Constructing instinct with out constructing fashions:

  1. If you see an AI characteristic you want, write down 3 ways they could have constructed it.
  2. Ask an AI engineer if you happen to’re proper.
  3. Fallacious guesses educate you greater than proper ones.

5. The brand new PM-engineer partnership

The most important shift? How I work with engineers.

Previous means: I write necessities. They construct it. We take a look at it. Ship.

New means: We label coaching knowledge collectively. We outline success metrics collectively. We debug failures collectively. We personal outcomes collectively.

Final month, I spent two hours with an engineer labeling whether or not responses had been “useful” or not. We disagreed on a variety of them. This taught me that I would like to start out collaborating on evals with my AI engineers.

Begin collaborating otherwise:

  • Subsequent characteristic: Ask to hitch a mannequin analysis session.
  • Provide to assist label take a look at knowledge.
  • Share buyer suggestions by way of eval metrics.
  • Have fun eval enhancements such as you used to have a good time characteristic launches.

Your 4-Week Transition Plan

Week 1: Instrument setup

  • Set up Cursor.
  • Get entry to your organization’s LLM playground.
  • Discover the place your AI logs/traces reside.
  • Construct one tiny prototype (took me three hours to construct my first).

Week 2: Remark

  • Hint 5 AI interactions in merchandise you utilize.
  • Doc what you assume occurred versus what truly occurred.
  • Share findings with an AI engineer for suggestions.

Week 3: Measurement

  • Create your first 20-example eval set.
  • Rating an present characteristic.
  • Suggest one enchancment primarily based on the scores.

Week 4: Collaboration

  • Be a part of an engineering mannequin assessment.
  • Volunteer to label 50 examples.
  • Body your subsequent characteristic request as eval standards.

Week 5: Iteration

  • Take your learnings from prototyping and construct them right into a manufacturing proposal.
  • Set the bar with evals.
  • Use your AI Instinct for iteration—Which knobs do you have to flip?

The Uncomfortable Fact

Right here’s what I want somebody had advised me three years in the past: You’ll really feel like a newbie once more. After years of being the professional within the room, you’ll be the individual asking primary questions. That’s precisely the place it is advisable to be.

The PMs who achieve AI are those who’re snug being uncomfortable. They’re those who construct dangerous prototypes, ask “dumb” questions, and deal with each complicated mannequin output as a studying alternative.

Begin this week

Don’t await the proper course, the perfect position, or for AI to “stabilize.” The talents you want are sensible, learnable, and instantly relevant.

Choose one factor from this put up, decide to doing it this week, after which inform somebody what you realized. That is the way you’ll start to speed up your individual suggestions loop for AI product administration.

The hole between PMs who discuss AI and PMs who construct with AI is smaller than you assume. It’s measured in hours of hands-on follow, not years of examine.

See you on the opposite facet.

RELATED ARTICLES

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Most Popular

Recent Comments