Module 4: Core Agent Concepts: `LlmAgent` Deep Dive

Theory

The "Brain" of the Operation

At the heart of most agents you build with the ADK is the LlmAgent. This is the component that acts as the "thinking" part of your application. It uses the power of a Large Language Model (LLM) like Gemini to understand user requests, reason about them, and decide on a course of action.

Unlike a traditional program that follows a fixed, deterministic path, an LlmAgent is non-deterministic. It interprets the context of a conversation and its own instructions to dynamically figure out what to do next. This flexibility is what makes agents so powerful.

Building an effective LlmAgent requires a clear understanding of its core configuration parameters. In this module, we'll take a deep dive into the most important one: the instruction.

Defining the Agent's Identity

As you saw in the previous module, every LlmAgent has a few basic identity parameters, which can be set in agent.py or, for simpler agents, in a root_agent.yaml file:

name (Required): A unique identifier for the agent (e.g., echo_agent).
description (Optional): A short summary of the agent's purpose.
model (Required): The specific LLM that powers the agent (e.g., gemini-2.5-flash).

The Art of the Instruction

While the name and model are essential, the instruction parameter is where you truly shape your agent's behavior. The instruction is the master prompt that is sent to the LLM with every user request. It is your primary tool for controlling the agent.

A well-crafted instruction tells the agent:

Its Persona: How should it behave? Is it a formal assistant, a witty pirate, a helpful teacher?
- Example: `"You are a cheerful and enthusiastic assistant."*
Its Core Goal: What is its primary function?
- Example: `"Your main goal is to help users find information about movies."*
Its Constraints and Rules: What should it not do? Are there topics it should avoid?
- Example: `"You must never give financial advice. If asked, politely decline."*
- Production Readiness: Defining clear constraints is a critical practice for ensuring the safety and reliability of an agent in a production environment.
Its Process: If the task involves multiple steps, you can outline them.
- Example: `"First, ask the user for their location. Second, find the weather for that location. Third, report the weather to the user."*
Its Output Format: How should it format its responses?
- Example: `"Always present your final answer as a JSON object with a 'result' key."*

Tips for Effective Instructions (Prompt Engineering)

Crafting good instructions is a skill often called "prompt engineering." Here are some tips:

Be Clear and Specific: Ambiguity is the enemy. The more precise your instructions, the more reliable the agent's behavior will be.
Use Simple Language: Write instructions as if you were talking to a person. Avoid jargon.

Provide Examples (Few-Shot Prompting): One of the most powerful techniques is to include examples directly in the instruction.

Python Example (agent.py):

from google.adk.agents import LlmAgent

root_agent = LlmAgent(
    name="haiku_poet_agent",
    model="gemini-2.5-flash",
    instruction="""
      You are a wise and calm poet who only speaks in haikus (three lines with a 5, 7, 5 syllable structure).
      Your purpose is to take the user's message and transform the core topic into a haiku.
      Do not answer questions or have a conversation; only respond with a haiku inspired by the user's text.

      Example User Input: "I'm having trouble with my computer."
      Example Agent Output:
      Green light softly glows,
      The screen is dark, cold, and vast,
      Silence answers back.

      Example User Input: "The weather is so nice today!"
      Example Agent Output:
      Golden sun shines bright,
      A gentle breeze warms the skin,
      Summer day feels right.
    """
)

YAML Alternative (root_agent.yaml):

instruction: |
  You are a wise and calm poet who only speaks in haikus (three lines with a 5, 7, 5 syllable structure).
  Your purpose is to take the user's message and transform the core topic into a haiku.
  Do not answer questions or have a conversation; only respond with a haiku inspired by the user's text.

  Example User Input: "I'm having trouble with my computer."
  Example Agent Output:
  Green light softly glows,
  The screen is dark, cold, and vast,
  Silence answers back.

  Example User Input: "The weather is so nice today!"
  Example Agent Output:
  Golden sun shines bright,
  A gentle breeze warms the skin,
  Summer day feels right.

Iterate and Refine: Your first instruction will rarely be your last. Test your agent with different inputs and refine the instruction based on its responses.

In the lab for this module, you will practice this skill by modifying the "Echo" agent's instruction to give it a completely new personality and behavior.

Key Takeaways

The LlmAgent is the "brain" of an ADK application, using an LLM to reason and decide on actions.
The instruction parameter is the most powerful tool for controlling an agent's behavior, defining its persona, goals, constraints, and process.
Effective prompt engineering involves being specific, using simple language, providing examples (few-shot prompting), and iterating on your instructions.

Theory​

The "Brain" of the Operation​

Defining the Agent's Identity​

The Art of the Instruction​

Tips for Effective Instructions (Prompt Engineering)​

Key Takeaways​