The split is now real
Prompt engineering and LLMOps are often mixed together, but they solve different problems.
Prompt engineering is about shaping model behaviour for a task: instructions, examples, context, output format and iterative testing. LLMOps is about running model-powered systems reliably: deployment, routing, monitoring, evals, versioning, cost control, safety checks and incident response.
The durable career path is rarely "I write clever prompts all day". The market has moved toward workflow engineering. Prompt skill still matters, but it is most valuable when combined with product understanding, APIs, retrieval, evaluation and production software habits. This shows up in public discussions such as Reddit prompt-engineering career threads, current AI lab roles on Anthropic jobs, and engineering roles on OpenAI careers.
What prompt engineers actually do
In a serious product team, prompt work can include:
- Translating product behaviour into instructions.
- Designing examples and output schemas.
- Testing refusal, ambiguity and edge cases.
- Working with domain experts to define quality.
- Improving retrieval context.
- Documenting prompt versions and changes.
- Pairing with engineers on tool calls and UI behaviour.
Weak prompt work sounds like magic phrasing. Strong prompt work sounds like product specification plus testing.
Example:
export function buildExtractionPrompt(emailBody: string) {
return [
"Extract a support ticket from the email.",
"Return valid JSON with keys: customerName, issueSummary, urgency.",
"Urgency must be one of: low, medium, high.",
"If a field is missing, use null.",
"",
"Email:",
emailBody,
].join("\n");
}
The prompt is only useful if you validate the output:
import { z } from "zod";
export const TicketSchema = z.object({
customerName: z.string().nullable(),
issueSummary: z.string().nullable(),
urgency: z.enum(["low", "medium", "high"]).nullable(),
});
That pairing is the point: prompt plus contract.
What LLMOps engineers do
LLMOps owns the operational layer:
- Model provider routing and fallback.
- Prompt and model version management.
- Evaluation pipelines.
- Latency and cost monitoring.
- Retrieval pipeline health.
- Safety filters and policy gates.
- Incident response when quality regresses.
- Deployment and rollback.
This overlaps with MLOps and DevOps, but LLM applications have their own failure modes. A deployment can pass unit tests and still degrade answer quality. A prompt change can increase token cost. A retrieval change can silently remove key context. A provider outage can break a workflow unless fallback exists.
Job postings now use LLMOps explicitly. Examples in the research include Apple's ML evaluation role, Sumo Logic's MLOps/LLMOps engineer posting, and LLMOps roles listed by Inference Jobs.
Interview differences
Prompt-heavy interviews may ask:
- Improve this support-agent prompt.
- Design examples for a classification task.
- Handle ambiguous user intent.
- Create a JSON output format.
- Explain how you would test quality with domain experts.
LLMOps interviews may ask:
- Design an eval pipeline for a RAG assistant.
- Monitor hallucination rate, latency and cost.
- Roll back a bad prompt release.
- Route across model providers.
- Detect retrieval drift.
- Secure prompts and logs.
AI engineer interviews often include both. That is why candidates should not present themselves as only "good at prompts". A stronger positioning is:
I can design prompts, but I treat them as versioned product logic. I pair prompt changes with eval cases, output validation and monitoring so the team knows whether behaviour improved.
Which path should you choose?
Choose prompt/product AI work if you enjoy:
- User workflows.
- Language and task design.
- Domain expert collaboration.
- UX and product behaviour.
- Rapid experimentation.
Choose LLMOps if you enjoy:
- Reliability.
- Metrics and dashboards.
- Infrastructure.
- Testing and release gates.
- Cost and latency optimisation.
Backend engineers often transition well into LLMOps. Product engineers, technical writers, support engineers and domain specialists may transition well into prompt-heavy AI product work, especially if they can code enough to work with APIs and validation.
For both paths, learn:
- TypeScript or Python.
- API integration.
- Structured outputs.
- Retrieval basics.
- Evaluation design.
- Data privacy basics.
- Prompt injection risks.
The portfolio that proves the split
Build one project and show both sides:
- Prompt: task framing, schema, examples and edge cases.
- LLMOps: eval set, latency logging, cost tracking and rollback notes.
README outline:
## Behaviour goal
What the AI feature should do.
## Prompt design
Instructions, examples and output schema.
## Evaluation
Golden cases, failure cases and scoring.
## Operations
Latency budget, cost estimate, logging and rollback.
This is more credible than listing "prompt engineering" as a skill with no artefact. It shows you understand the role split.
Continue your prep
Compare both paths against live role prep: