feat(preloadmemorytool,tools): handle rich content types and add tool registry/discovery system by kshitizz36 · Pull Request #129 · google/adk-python

kshitizz36 · 2025-04-12T18:44:34Z

Summary

This PR enhances the PreloadMemoryTool class to support richer content types beyond plain text.

Changes

Added support for processing multiple content part types in memory events:
- Function calls (with function name and arguments)
- Function responses (with function name and response data)
- Inline data (with MIME type information)
- Text (existing functionality)
Implemented human-readable formatting for each content type
Ensured all content parts are properly joined when building memory context

Benefits

Improves LLM context by including data about past interactions with functions and non-textual content
Maintains readability of the preloaded memory for the LLM
Ensures comprehensive representation of past user-system interactions

kshitizz36 · 2025-04-15T01:31:25Z

@hangfei @boyangsvl PTAL.

kshitizz36 · 2025-04-15T04:33:19Z

@Jacksunwei PTAL

…based on context

kshitizz36 · 2025-04-15T05:34:38Z

Add Tool Registry and Discovery System

Changes

Added tool_registry.py with a singleton ToolRegistry class for managing tool registrations
Updated base_tool.py to support automatic registration of tools
Added tool_discovery.py to demonstrate selecting appropriate tools based on context
Added support for categorizing tools for easier organization

Benefits

Better organization of the growing tool ecosystem
Runtime flexibility in tool selection based on user needs and context
Easier extension of the framework with new tools
More intelligent agent capabilities through context-aware tool selection

Testing

The changes have been tested with the existing tools to ensure backward compatibility.

kshitizz36 · 2025-04-15T05:46:35Z

Query

Would it be possible to create a new branch off the base branch main for the commit Add Tool Registry and Discovery System?
@hangfei

Jacksunwei · 2025-04-15T06:13:40Z

Thanks for the PR!

For preload_memory_tool, IIUC, you're implementing the TODO. However, the todo is meant to be muti-part of text, instead of other types of parts.

For the tool_discovery and tool_registry, This is a big change to api and we need to evaluate it further. Unless LlmModel, we don't see a substantially big gain of having a tool register. Do you have a sample agent that benefit from this?

kshitizz36 · 2025-04-15T07:12:35Z

Hi @Jacksunwei , thanks for clarifying the requirement for handling text parts specifically.

Based on your feedback, I've pushed an update. The logic in preload_memory_tool now explicitly loops through event.content.parts, checks if part.text:, collects these text strings, and then joins them. This addresses the multi-part TODO while correctly focusing only on the textual content.

Let me know if this looks better!

kshitizz36 · 2025-04-15T08:23:17Z

Key Benefits with Concrete Examples

Dynamic Tool Discovery:
- Current approach: Agents need to explicitly import and instantiate specific tools.
- With registry: Agents can discover relevant tools at runtime based on task descriptions.
- Example: A chatbot agent can dynamically select calendar tools when the user mentions scheduling, or calculator tools when the user needs calculations.
Simplified Agent Implementation:
- Current approach: Agents maintain hardcoded lists of available tools.
- With registry: Agents can query the registry for appropriate tools.
- Example: An agent handling email tasks can query "email" category tools without knowing all tool implementations.
Extensibility:
- Current approach: Adding new tools requires modifying agent code.
- With registry: New tools can be added independently and discovered automatically.
- Example: Third-party developers can create plugins that register seamlessly with the system.
Testing and Mocking:
- Current approach: Mock tools require changing imports or dependency injection.
- With registry: Mock tools can be registered temporarily during tests.
- Example: Tests can register mock implementations that don't make actual API calls.

Sample Agent Implementation

I've included a sample TaskAssistantAgent implementation below that demonstrates the benefits of the tool registry. This agent selects appropriate tools based on user requests rather than having a fixed set of predefined tools.

Example: TaskAssistantAgent

# Copyright 2025 Google LLC
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#     http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

"""TaskAssistantAgent that dynamically selects tools based on user requests."""

from __future__ import annotations

from typing import List, Optional

from google.genai import types

from ..agents.base_agent import BaseAgent
from ..models.llm_request import LlmRequest
from ..tools.base_tool import BaseTool
from ..tools.tool_context import ToolContext
from ..tools.tool_discovery import ToolDiscovery


class TaskAssistantAgent(BaseAgent):
  """An agent that dynamically selects tools based on the user's task."""

  name: str = "task_assistant"
  description: str = "A helpful assistant that selects appropriate tools for your task."

  def __init__(
      self,
      name: str = "task_assistant",
      description: str = "A helpful assistant that selects appropriate tools for your task.",
      tool_categories: Optional[List[str]] = None,
      max_tools_per_task: int = 5,
  ):
    """Initialize the TaskAssistantAgent.
    
    Args:
      name: The name of the agent.
      description: The description of the agent.
      tool_categories: Optional list of tool categories to consider. If None, all
        registered tools are considered.
      max_tools_per_task: Maximum number of tools to select for a given task.
    """
    super().__init__(name=name, description=description)
    self._tool_categories = tool_categories
    self._max_tools_per_task = max_tools_per_task
    self._current_tools: List[BaseTool] = []

  async def process_user_request(
      self, user_message: str, tool_context: ToolContext
  ) -> LlmRequest:
    """Process a user request by selecting appropriate tools and preparing the LLM request.
    
    Args:
      user_message: The user's message.
      tool_context: The tool context.
      
    Returns:
      LLM request configured with appropriate tools.
    """
    # Select tools based on the user's message
    self._current_tools = ToolDiscovery.get_tools_for_task(
        task_description=user_message,
        categories=self._tool_categories,
        max_tools=self._max_tools_per_task,
    )
    
    # Create an LLM request with the selected tools
    llm_request = LlmRequest(
        config=types.GenerateContentConfig(
            temperature=0.2,
        ),
        contents=[
            types.Content(
                role="user",
                parts=[
                    types.Part(text=user_message),
                ],
            ),
        ],
        tools_dict={},
    )
    
    # Add the selected tools to the request
    for tool in self._current_tools:
      await tool.process_llm_request(
          tool_context=tool_context, llm_request=llm_request
      )
    
    # Log the tools that were selected
    tool_names = [tool.name for tool in self._current_tools]
    print(f"Selected tools for this task: {', '.join(tool_names)}")
    
    return llm_request

kshitizz36 · 2025-04-15T08:24:00Z

Example of Using the TaskAssistantAgent

Here's a script that show how to use the TaskAssistantAgent:

# Example script showing TaskAssistantAgent in action

import asyncio
import os
from google.genai import GenerativeModel

from google.adk.agents.task_assistant_agent import TaskAssistantAgent
from google.adk.tools.calculator_tool import CalculatorTool
from google.adk.tools.weather_tool import WeatherTool
from google.adk.tools.calendar_tool import CalendarTool
from google.adk.tools.search_tool import SearchTool
from google.adk.tools.tool_context import ToolContext

# Register various tools that will be discovered by the agent
# In a real implementation, these would be registered through imports
# or as part of the package initialization

# First, ensure all tools are registered
# In practice, these imports would be enough to register the tools
# through the auto-registration mechanism

async def main():
    # Create the agent
    agent = TaskAssistantAgent(
        name="personal_assistant",
        description="A personal assistant that helps with various tasks",
        tool_categories=None,  # Consider all tool categories
        max_tools_per_task=3,  # Select up to 3 tools per task
    )
    
    # Create a tool context
    tool_context = ToolContext()
    
    # Example 1: Weather-related query
    print("\n=== Example 1: Weather Query ===")
    user_message = "What's the weather like in San Francisco today?"
    llm_request = await agent.process_user_request(user_message, tool_context)
    # In a real implementation, you would send this to the LLM and process the response
    
    # Example 2: Calendar-related query
    print("\n=== Example 2: Calendar Query ===")
    user_message = "Schedule a meeting with John tomorrow at 2pm"
    llm_request = await agent.process_user_request(user_message, tool_context)
    
    # Example 3: Math calculation
    print("\n=== Example 3: Math Calculation ===")
    user_message = "What is the square root of 144 divided by 2?"
    llm_request = await agent.process_user_request(user_message, tool_context)
    
    # Example 4: General knowledge query
    print("\n=== Example 4: Knowledge Query ===")
    user_message = "Who was the first person to walk on the moon?"
    llm_request = await agent.process_user_request(user_message, tool_context)

if __name__ == "__main__":
    asyncio.run(main())

This reverts commit 07c78ec.

This reverts commit 0d29158.

kshitizz36 · 2025-04-23T18:26:21Z

@hangfei @Jacksunwei PTAL

kshitizz36 · 2025-04-24T02:56:35Z

Key Benefits with Concrete Examples

Dynamic Tool Discovery:
- Current approach: Agents need to explicitly import and instantiate specific tools.
- With registry: Agents can discover relevant tools at runtime based on task descriptions.
- Example: A chatbot agent can dynamically select calendar tools when the user mentions scheduling, or calculator tools when the user needs calculations.
Simplified Agent Implementation:
- Current approach: Agents maintain hardcoded lists of available tools.
- With registry: Agents can query the registry for appropriate tools.
- Example: An agent handling email tasks can query "email" category tools without knowing all tool implementations.
Extensibility:
- Current approach: Adding new tools requires modifying agent code.
- With registry: New tools can be added independently and discovered automatically.
- Example: Third-party developers can create plugins that register seamlessly with the system.
Testing and Mocking:
- Current approach: Mock tools require changing imports or dependency injection.
- With registry: Mock tools can be registered temporarily during tests.
- Example: Tests can register mock implementations that don't make actual API calls.

Sample Agent Implementation

I've included a sample TaskAssistantAgent implementation below that demonstrates the benefits of the tool registry. This agent selects appropriate tools based on user requests rather than having a fixed set of predefined tools.

Example: TaskAssistantAgent

# Copyright 2025 Google LLC
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#     http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

"""TaskAssistantAgent that dynamically selects tools based on user requests."""

from __future__ import annotations

from typing import List, Optional

from google.genai import types

from ..agents.base_agent import BaseAgent
from ..models.llm_request import LlmRequest
from ..tools.base_tool import BaseTool
from ..tools.tool_context import ToolContext
from ..tools.tool_discovery import ToolDiscovery


class TaskAssistantAgent(BaseAgent):
  """An agent that dynamically selects tools based on the user's task."""

  name: str = "task_assistant"
  description: str = "A helpful assistant that selects appropriate tools for your task."

  def __init__(
      self,
      name: str = "task_assistant",
      description: str = "A helpful assistant that selects appropriate tools for your task.",
      tool_categories: Optional[List[str]] = None,
      max_tools_per_task: int = 5,
  ):
    """Initialize the TaskAssistantAgent.
    
    Args:
      name: The name of the agent.
      description: The description of the agent.
      tool_categories: Optional list of tool categories to consider. If None, all
        registered tools are considered.
      max_tools_per_task: Maximum number of tools to select for a given task.
    """
    super().__init__(name=name, description=description)
    self._tool_categories = tool_categories
    self._max_tools_per_task = max_tools_per_task
    self._current_tools: List[BaseTool] = []

  async def process_user_request(
      self, user_message: str, tool_context: ToolContext
  ) -> LlmRequest:
    """Process a user request by selecting appropriate tools and preparing the LLM request.
    
    Args:
      user_message: The user's message.
      tool_context: The tool context.
      
    Returns:
      LLM request configured with appropriate tools.
    """
    # Select tools based on the user's message
    self._current_tools = ToolDiscovery.get_tools_for_task(
        task_description=user_message,
        categories=self._tool_categories,
        max_tools=self._max_tools_per_task,
    )
    
    # Create an LLM request with the selected tools
    llm_request = LlmRequest(
        config=types.GenerateContentConfig(
            temperature=0.2,
        ),
        contents=[
            types.Content(
                role="user",
                parts=[
                    types.Part(text=user_message),
                ],
            ),
        ],
        tools_dict={},
    )
    
    # Add the selected tools to the request
    for tool in self._current_tools:
      await tool.process_llm_request(
          tool_context=tool_context, llm_request=llm_request
      )
    
    # Log the tools that were selected
    tool_names = [tool.name for tool in self._current_tools]
    print(f"Selected tools for this task: {', '.join(tool_names)}")
    
    return llm_request

@Jacksunwei PTAL

kshitizz36 · 2025-04-24T02:57:02Z

Example of Using the TaskAssistantAgent

Here's a script that show how to use the TaskAssistantAgent:

# Example script showing TaskAssistantAgent in action

import asyncio
import os
from google.genai import GenerativeModel

from google.adk.agents.task_assistant_agent import TaskAssistantAgent
from google.adk.tools.calculator_tool import CalculatorTool
from google.adk.tools.weather_tool import WeatherTool
from google.adk.tools.calendar_tool import CalendarTool
from google.adk.tools.search_tool import SearchTool
from google.adk.tools.tool_context import ToolContext

# Register various tools that will be discovered by the agent
# In a real implementation, these would be registered through imports
# or as part of the package initialization

# First, ensure all tools are registered
# In practice, these imports would be enough to register the tools
# through the auto-registration mechanism

async def main():
    # Create the agent
    agent = TaskAssistantAgent(
        name="personal_assistant",
        description="A personal assistant that helps with various tasks",
        tool_categories=None,  # Consider all tool categories
        max_tools_per_task=3,  # Select up to 3 tools per task
    )
    
    # Create a tool context
    tool_context = ToolContext()
    
    # Example 1: Weather-related query
    print("\n=== Example 1: Weather Query ===")
    user_message = "What's the weather like in San Francisco today?"
    llm_request = await agent.process_user_request(user_message, tool_context)
    # In a real implementation, you would send this to the LLM and process the response
    
    # Example 2: Calendar-related query
    print("\n=== Example 2: Calendar Query ===")
    user_message = "Schedule a meeting with John tomorrow at 2pm"
    llm_request = await agent.process_user_request(user_message, tool_context)
    
    # Example 3: Math calculation
    print("\n=== Example 3: Math Calculation ===")
    user_message = "What is the square root of 144 divided by 2?"
    llm_request = await agent.process_user_request(user_message, tool_context)
    
    # Example 4: General knowledge query
    print("\n=== Example 4: Knowledge Query ===")
    user_message = "Who was the first person to walk on the moon?"
    llm_request = await agent.process_user_request(user_message, tool_context)

if __name__ == "__main__":
    asyncio.run(main())

@Jacksunwei PTAL

boyangsvl · 2025-04-24T06:27:28Z

I think it's better to separate this into two PRs if it tackles two different problems: preload_memory and tool registry.

For tool registry, we are thinking of a unified Toolset interface which supports dynamic tool selection.

hangfei · 2025-05-30T21:45:19Z

ToolRegistry is already supported.

kshitizz36 added 7 commits April 12, 2025 14:59

Fixed typo in README.md : cooridnator -> coordinator

e337892

Merge branch 'google:main' into main

5e7dd5e

Merge branch 'google:main' into main

48add29

Enhance PreloadMemoryTool with Multi-Part Content Support

e2e33f5

Enhance PreloadMemoryTool to handle rich content types

99d3f13

Merge branch 'main' into main

e768fa5

Merge branch 'main' into main

fc3e9b6

Merge branch 'google:main' into main

0dd5ed0

kshitizz36 added 3 commits April 15, 2025 10:53

Added tool_registry.py

88df737

Updated base_tool.py to support automatic registration of tools

e778573

Added tool_discovery.py to demonstrate selecting appropriate tools …

d698e7c

…based on context

Implement multi-part text handling in preload_memory_tool.py

6369197

kshitizz36 and others added 8 commits April 16, 2025 20:36

Update README.md

0d29158

Update README.md

07c78ec

Revert "Update README.md"

fd2ce2f

This reverts commit 07c78ec.

Revert "Update README.md"

2f39c87

This reverts commit 0d29158.

Merge branch 'google:main' into main

98f0c90

Merge branch 'google:main' into main

8a98fc0

Merge branch 'google:main' into main

4a77de5

Merge branch 'google:main' into main

55a922a

kshitizz36 changed the title ~~Enhance PreloadMemoryTool to handle rich content types~~ feat(preloadmemorytool,tools): handle rich content types and add tool registry/discovery system Apr 19, 2025

Merge branch 'main' into main

3255d07

kshitizz36 added 7 commits April 21, 2025 08:53

Merge branch 'main' into main

13a2b44

Merge branch 'google:main' into main

21ff1c5

Merge branch 'main' into main

d840f14

Merge branch 'main' into main

4b1c2f3

Merge branch 'main' into main

0f279b6

Merge branch 'main' into main

8950117

Merge branch 'main' into main

84e1a8a

Merge branch 'main' into main

8a8f7db

kshitizz36 mentioned this pull request Apr 24, 2025

feat(memory): improve multi-part text handling in PreloadMemoryTool #375

Closed

kshitizz36 marked this pull request as draft April 26, 2025 06:45

kshitizz36 closed this May 31, 2025

Uh oh!

Conversation

kshitizz36 commented Apr 12, 2025

Summary

Changes

Benefits

Uh oh!

kshitizz36 commented Apr 15, 2025

Uh oh!

kshitizz36 commented Apr 15, 2025

Uh oh!

kshitizz36 commented Apr 15, 2025

Add Tool Registry and Discovery System

Changes

Benefits

Testing

Uh oh!

kshitizz36 commented Apr 15, 2025

Query

Uh oh!

Jacksunwei commented Apr 15, 2025

Uh oh!

kshitizz36 commented Apr 15, 2025

Uh oh!

kshitizz36 commented Apr 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Key Benefits with Concrete Examples

Sample Agent Implementation

Example: TaskAssistantAgent

Uh oh!

kshitizz36 commented Apr 15, 2025

Example of Using the TaskAssistantAgent

Here's a script that show how to use the TaskAssistantAgent:

Uh oh!

kshitizz36 commented Apr 23, 2025

Uh oh!

kshitizz36 commented Apr 24, 2025

Key Benefits with Concrete Examples

Sample Agent Implementation

Example: TaskAssistantAgent

Uh oh!

kshitizz36 commented Apr 24, 2025

Example of Using the TaskAssistantAgent

Here's a script that show how to use the TaskAssistantAgent:

Uh oh!

boyangsvl commented Apr 24, 2025

Uh oh!

hangfei commented May 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kshitizz36 commented Apr 15, 2025 •

edited

Loading