Agentic Workflows

with DeepSeek

in LlamaIndex

2025-02-06 AICamp DeepSeek+

What are we talking about?

What is LlamaIndex
What is an Agent?
Agent Patterns in LlamaIndex
- Chaining
- Routing
- Parallelization
- Orchestrator-Workers
- Evaluator-Optimizer
Giving Agents access to data
How RAG works
Using DeepSeek with LlamaIndex

What is LlamaIndex?

Python: docs.llamaindex.ai

TypeScript: ts.llamaindex.ai

LlamaParse

World's best parser of complex documents

Free for 1000 pages/day!

cloud.llamaindex.ai

LlamaCloud

Turn-key RAG API for Enterprises

Available as SaaS or private cloud deployment

1. Sign up:

cloud.llamaindex.ai

2. Get on the waitlist!

bit.ly/llamacloud-waitlist

LlamaHub

llamahub.ai

Why LlamaIndex?

Build faster
Skip the boilerplate
Avoid early pitfalls
Get best practices for free
Go from prototype to production

What is an agent?

A new paradigm for programming

Building Effective Agents

anthropic.com/research/building-effective-agents

Chaining
Routing
Parallelization
Orchestrator-Workers
Evaluator-Optimizer

Chaining

Workflows

class MyWorkflow(Workflow):
    @step
    async def step_one(self, ev: StartEvent) -> FirstEvent:
        print(ev.first_input)
        return FirstEvent(first_output="First step complete.")

    @step
    async def step_two(self, ev: FirstEvent) -> SecondEvent:
        print(ev.first_output)
        return SecondEvent(second_output="Second step complete.")

    @step
    async def step_three(self, ev: SecondEvent) -> StopEvent:
        print(ev.second_output)
        return StopEvent(result="Workflow complete.")

Workflow visualization

Routing

Branching

Parallelization

Sectioning

Parallelization: flavor 1

Voting

Parallelization: flavor 2

Concurrency

Orchestrator-Workers

Concurrency

(again)

Orchestration in LlamaIndex

Evaluator-Optimizer

aka Self-reflection

Looping

Arbitrary complexity

Tool use

def multiply(a: int, b: int) -> int:
    """Multiply two integers and return the result."""
    return a * b

multiply_tool = FunctionTool.from_defaults(fn=multiply)

FunctionCallingAgent

agent = FunctionCallingAgent.from_tools(
    [multiply_tool],
    llm=llm,
)
response = agent.chat("What is 3 times 4?")

Multi-agent systems

AgentWorkflow

research_agent = FunctionAgent(
    name="ResearchAgent",
    description="Useful for searching the web for information on a given topic and recording notes on the topic.",
    system_prompt=(
        "You are the ResearchAgent that can search the web for information on a given topic and record notes on the topic. "
        "Once notes are recorded and you are satisfied, you should hand off control to the WriteAgent to write a report on the topic."
    ),
    llm=llm,
    tools=[search_web, record_notes],
    can_handoff_to=["WriteAgent"],
)

Multi-agent system

as a one-liner

agent_workflow = AgentWorkflow(
    agents=[research_agent, write_agent, review_agent],
    root_agent=research_agent.name,
    initial_state={
        "research_notes": {},
        "report_content": "Not written yet.",
        "review": "Review required.",
    },
)

Full AgentWorkflow and Workflows tutorial

bit.ly/workflows-tutorial

Giving agents

access to data

RAG provides

infinite context

How RAG works

Basic RAG pipeline

Loading

Parsing

Embedding

Storing

Retrieving

Querying

Multi-modal

RAG in 5 lines

documents = LlamaParse().load_data("./myfile.pdf")

index = VectorStoreIndex.from_documents(documents)

query_engine = index.as_query_engine()

response = query_engine.query("What did the author do growing up?")

print(response)

RAG limitations

Summarization

Solve it with: routing, parallelization

Comparison

Solve it with: parallelization

Multi-part questions

Solve it with: chaining, parallelization

Recap

Using DeepSeek

Step 1:

pip install llama-index-llms-deepseek

Step 2:

from llama_index.llms.deepseek import DeepSeek
llm = DeepSeek(model="deepseek-chat", api_key="your_api_key")

Using local embeddings

from llama_index.embeddings.huggingface import HuggingFaceEmbedding

embed_model = HuggingFaceEmbedding(model_name="./bge-small-en-v1.5-model")

index = VectorStoreIndex.from_documents(documents,embed_model=embed_model)

Other places to DeepSeek

Azure AI
Google Vertex AI
AWS Bedrock
IBM WatsonX
Ollama
Fireworks AI
Databricks
Groq

llm = Groq(
  model="deepseek-r1-distill-llama-70b",
  api_key="your_api_key"
)

Example:

DeepSeek is great!

(And so easy to use!)

What's next?

bit.ly/llamaindex-discord

Thanks!

Follow me on BlueSky:

@seldo.com

bit.ly/llamaindex-deepseek

Agentic Workflows

with DeepSeek

in LlamaIndex

What are we talking about?

What is LlamaIndex?

LlamaParse

LlamaCloud

LlamaHub

Why LlamaIndex?

What is an agent?

Building Effective Agents

Chaining

Workflows

Workflow visualization

Routing

Branching

Parallelization

Sectioning

Voting

Concurrency

Orchestrator-Workers

Concurrency

Orchestration in LlamaIndex

Evaluator-Optimizer

Looping

Arbitrary complexity

Tool use

FunctionCallingAgent

Multi-agent systems

AgentWorkflow

Multi-agent system

Full AgentWorkflow and Workflows tutorial

Giving agents

access to data

RAG provides

infinite context

How RAG works

Basic RAG pipeline

Loading

Parsing

Embedding

Storing

Retrieving

Querying

Multi-modal

RAG in 5 lines

RAG limitations

Summarization

Comparison

Multi-part questions

Recap

Using DeepSeek

Using local embeddings

Other places to DeepSeek

DeepSeek is great!

What's next?

Thanks!

Agentic Workflows with DeepSeek in LlamaIndex

More from seldo