Section 2AI Agents

Researcher Agent Deep Dive

Analysis of Microsoft's Researcher Agent, its limitations, and a complete custom agent architecture design for organizational knowledge retention.

Microsoft Researcher Agent: How It Works

The Microsoft Researcher Agent (available through Copilot) operates as an autonomous research orchestrator. When given a query, it:

  1. Decomposes the query into sub-questions
  2. Plans a research strategy across available sources
  3. Executes multiple searches (web, organizational data via Microsoft Graph)
  4. Synthesizes findings into a coherent response
  5. Cites sources for verification

Architecture Model

User Query → Query Decomposition → Search Planning → Parallel Execution
                                                          ↓
Final Report ← Synthesis ← Source Ranking ← Results Collection

Strengths

Strength Description Tax/Finance Relevance
Multi-source synthesis Combines web + organizational data Research across legislation, guidance, and internal policy
Autonomous planning Breaks complex queries into steps Handles multi-jurisdictional questions
Citation tracking Links to source material Audit trail for advisory positions
Iterative refinement Can be redirected with follow-ups Narrow down from broad research
Web freshness Accesses current online sources Latest regulatory changes

Weaknesses

Weakness Root Cause Impact on Tax Work
No persistent memory Session-based context only Cannot build cumulative knowledge of Canon's positions
Limited organizational depth Shallow SharePoint indexing Misses buried policy documents and historical memos
No learning from corrections Cannot incorporate feedback Repeats same errors across sessions
Generic reasoning Not domain-trained for tax Misapplies general legal reasoning to VAT-specific concepts
Source quality blindness Treats all web sources equally May cite outdated guidance or non-authoritative commentary
Context window constraints Fixed token limit Cannot process lengthy legislative texts in full

Why It Does Not Retain Knowledge Well

The Researcher Agent's memory limitation stems from three architectural constraints:

1. Stateless Session Design Each conversation starts with zero prior context. Unlike a human advisor who accumulates client knowledge over years, the agent has no mechanism to persist learning between sessions.

2. No Organizational Knowledge Graph The agent searches documents but does not maintain a structured understanding of relationships: which entities operate where, what positions have been taken previously, which advisors have given which opinions.

3. No Feedback Loop When you correct the agent ("No, we use the triangulation simplification for those supplies"), that correction exists only in the current session. The next time someone asks a similar question, the agent starts from scratch.

How to Improve Its Effectiveness

Prompting Strategies for Better Results

Strategy 1: Context Front-Loading Always begin researcher sessions with comprehensive context:

"Context for this research session: I am the VAT Manager at Canon Europe. We operate manufacturing, distribution, and services entities across 15 EU member states. Our ERP is Oracle R12. Key positions to be aware of: [list 3-5 critical current positions]. Standard applicable: EU VAT Directive 2006/112/EC and relevant implementing measures."

Strategy 2: Source Quality Instructions Direct the agent toward authoritative sources:

"When researching this question, prioritize in this order: (1) Primary legislation and EU directives, (2) ECJ/CJEU case law, (3) National tax authority guidance (official publications only), (4) Big 4/law firm publications from the last 12 months, (5) Academic commentary. Do NOT rely on: blog posts, forum discussions, or articles older than 3 years."

Strategy 3: Structured Output Requirements Force systematic analysis:

"Structure your research as: (A) Legal framework — applicable provisions with article references, (B) Case law — relevant ECJ decisions with case numbers, (C) Administrative guidance — applicable rulings or guidance notes, (D) Practical application — how this applies to our specific facts, (E) Risk assessment — confidence level and open questions."

Strategy 4: Iterative Deepening Use multiple rounds to build depth:

Round 1: "Give me a broad overview of [topic] across EU jurisdictions." Round 2: "Focus specifically on [Country A] and [Country B] — what are the key differences?" Round 3: "For the [Country A] position, what is the most recent case law?" Round 4: "Based on all of the above, draft a recommendation for our specific situation."


Custom Agent Design

Option A: Copilot Studio Agent

Architecture Overview:

A Copilot Studio agent configured with:

  • Custom Topics for tax-specific conversation flows
  • Knowledge Sources connected to SharePoint, Dataverse, and the IDF data lake
  • Custom Actions using Power Automate for structured workflows
  • Generative AI with system prompts tailored for tax analysis

Implementation Steps:

Phase Duration Activities
1. Foundation 2 weeks Create agent, configure knowledge sources, set system prompts
2. Knowledge Loading 3 weeks Index SharePoint libraries, structure Dataverse entities, connect data lake
3. Topic Design 2 weeks Build conversation topics for top 20 use cases
4. Testing 2 weeks User acceptance testing with real queries
5. Refinement Ongoing Monitor, tune, expand knowledge base

Knowledge Source Configuration:

Source Content Type Update Frequency Priority
SharePoint - Tax Policies Policy documents, procedures Monthly High
SharePoint - Advisory Archive Historical memos, analyses As created High
SharePoint - IDF Docs Project documentation, specs Weekly Medium
Dataverse - Decision Log Past decisions with rationale As captured Critical
Dataverse - Position Register Current VAT positions by jurisdiction As changed Critical
Data Lake - Invoice Data Transaction patterns, volumes Daily Medium
Outlook (Graph) Recent email threads on tax topics Real-time Low
Teams Transcripts Meeting decisions, action items Post-meeting Medium

System Prompt Design:

You are the Canon Tax & Finance AI Agent. Your role is to assist tax professionals 
with research, analysis, and documentation tasks.

CRITICAL RULES:
1. Always state when you are uncertain. Tax advice requires accuracy.
2. Reference specific legislation articles and case numbers.
3. When citing Canon internal positions, reference the source document and date.
4. Flag any areas where Canon's current position may differ from general guidance.
5. For questions involving multiple jurisdictions, address each separately.
6. Always include a confidence assessment: High (clear law, settled position), 
   Medium (some ambiguity, requires judgment), Low (unclear, recommend specialist review).
7. Do not provide advice on topics outside your knowledge base — suggest escalation.

YOUR KNOWLEDGE:
- Canon's VAT registration details by country
- Historical advisory memos and their conclusions
- Current VAT positions and the rationale behind them
- IDF project status and documentation
- Relevant EU and domestic tax legislation
- Recent case law and tax authority guidance

Option B: Custom AI Agent Architecture (Azure-Based)

For maximum control and knowledge retention, a custom architecture using Azure services:

Technology Stack:

Layer Technology Purpose
Interface Teams Bot / Web Chat User interaction
Orchestration Azure AI Agent Service Query routing and planning
LLM Azure OpenAI GPT-4 Reasoning and generation
Knowledge Azure AI Search Document retrieval (RAG)
Memory Azure Cosmos DB Persistent conversation and decision memory
Storage Azure Blob + SharePoint Document repository
Integration Microsoft Graph API Email, calendar, Teams data
Monitoring Azure Monitor + App Insights Usage tracking, quality metrics

Key Differentiator: Persistent Memory

Unlike the standard Researcher Agent, this architecture includes:

  1. Decision Memory — Every advisory conclusion is stored with its reasoning chain
  2. Correction Learning — User corrections are captured and applied to future responses
  3. Position Tracking — Current Canon positions are maintained as structured data
  4. Context Accumulation — Key facts about Canon's operations are persistently available

Implementation Estimate:

Component Effort Monthly Cost (Est.)
Azure OpenAI (GPT-4) Setup: 1 week €500-1,500
Azure AI Search Setup: 2 weeks €200-500
Cosmos DB Setup: 1 week €50-200
Bot Framework Development: 4 weeks €50 (hosting)
Graph API Integration Development: 2 weeks Included in M365
Document Processing Development: 2 weeks €100-300
Total ~12 weeks €900-2,500/month

Recommendation

Start with Option A (Copilot Studio) for these reasons:

  • Lower development cost and effort
  • Stays within Canon's existing Microsoft ecosystem
  • No additional Azure infrastructure to manage
  • Copilot Studio licensing likely available through existing E5/E3
  • Can be upgraded to Option B later if limitations emerge

Move to Option B when:

  • Knowledge retention requirements exceed Copilot Studio capabilities
  • Volume of queries justifies dedicated infrastructure
  • Need for custom reasoning chains or specialized tax logic
  • Integration with data lake requires more sophisticated processing