Ollama, Deepseek, and Graphs

Since the release of Deepseek, many people have been asking how good the model really is.

Despite all the benchmarks, all the memes, and the concerns about potential legal action, one question remains unanswered. This question is particularly relevant to us at Cognee, where we use LLMs to transform your data into agent-ready data layers.

Creating an LLM-Powered Knowledge Graph

One of the areas we explored was how capable these models are at creating LLM-powered knowledge graphs. Creating a simple knowledge graph is relatively straightforward: we define a Pydantic model, then use an LLM (via a function call) to generate a JSON output containing the nodes and edges needed to populate that Pydantic model.

Here’s an example of a model we can populate:

class Node(BaseModel):
    """Node in a knowledge graph."""

    id: str
    name: str
    type: str
    description: str
    properties: Optional[Dict[str, Any]] = Field(
        None, description="A dictionary of properties associated with the node."
    )

class Edge(BaseModel):
    """Edge in a knowledge graph."""

    source_node_id: str
    target_node_id: str
    relationship_name: str
    properties: Optional[Dict[str, Any]] = Field(
        None, description="A dictionary of properties associated with the edge."
    )

class KnowledgeGraph(BaseModel):
    """Knowledge graph."""

    nodes: List[Node] = Field(..., default_factory=list)
    edges: List[Edge] = Field(..., default_factory=list)

Then we send some data, for example:


job_position = """
Senior Data Scientist (Machine Learning)

Company: TechNova Solutions
Location: San Francisco, CA

Job Description:

TechNova Solutions is seeking a Senior Data Scientist specializing in Machine Learning ...
(etc.)
Candidate CVs
"""

job_1 = """
CV 1: Relevant
Name: Dr. Emily Carter
Contact Information:
(etc.)
"""

job_2 = """
CV 2: Relevant
Name: Michael Rodriguez
Contact Information:
(etc.)
"""

job_3 = """
CV 3: Relevant
Name: Sarah Nguyen
Contact Information:
(etc.)
"""
...

Below is the corresponding query we would send to the LLM to add the data to the graph:

async def extract_content_graph(content: str, response_model: Type[BaseModel]):
    llm_client = get_llm_client()

    system_prompt = render_prompt("generate_graph_prompt.txt", {})
    content_graph = await llm_client.acreate_structured_output(
        content, system_prompt, response_model
    )

    return content_graph

When using OpenAI to generate a graph, we get a result similar to this:

openai

Comparing Deepseek and Other Models

We used Ollama, a free, open-source tool that lets you run large language models (LLMs) locally on your computer, to run Deepseek. However, when using Ollama with Deepseek models, we found they often struggle to generate even basic structured output. They sometimes fail to run locally and occasionally return answers in Chinese. We tried running the following models:

Deepseek-r1:1.5b
Deepseek-r1:7b
Deepseek-r1:8b

To compare Deepseek with another small model, we used the latest Mistral 7B. This model performed better on a simple structured example and could return structured output. However, with Cognee, it still failed to generate our knowledge graph structure (nodes and edges). Unsurprisingly, it ended with an error during search because it didn’t extract any data, leaving the collections empty. It also sometimes fails to produce a basic string response in a structured format.

Meanwhile, the Llama 3.1 7B model is more usable and can occasionally generate the graph as well as provide answers from retrieved context.

Running the Deepseek-r1:32b model produced some surprising results. The larger model can actually create simpler graphs and, after several retries, it gives reasonable outputs. You can see the graph visualization below, which doesn’t differ much from what OpenAI generates:

deepseek_r1

Where Deepseek Stands: Key Takeaways for LLM-powered Knowledge Graphs

Our conclusion is that smaller models still aren’t quite there. While larger Deepseek models can produce impressive results, they aren’t yet reliable enough for our team’s needs. However, as model sizes increase—and as we move away from distilled models—the Ollama–Deepseek combination shows promise, and other models also perform better at generating graphs.

In the follow up blog, we will run some evaluations an show the difference in answers quality between different models.

If you’d like to try it yourself, please refer to our documentation.

You can find more information about Ollama here.

You can also join the Ollama Discord to learn more.

Finally, cognee has a vibrant Discord community—join us to get your questions answered, learn, and share your insights!

Join the cognee community to hear about new releases, use cases, and all the things we're working on.

From the blog

Deep Dives

AI Memory Meets Real-World Testing: Rethinking Traditional QA BenchmarksSee how cognee performs against LightRAG, Graphiti (Zep), and Mem0 in AI memory benchmarks. Explore detailed comparisons, evaluation metrics, and try yourself!

Enhancing Knowledge Graphs with Ontology IntegrationDiscover how integrating formal ontologies with knowledge graphs dramatically improves information retrieval, semantic understanding, and query capabilities.

cognee & LlamaIndex: Building Powerful GraphRAG PipelinesLearn to build GraphRAG pipelines with cognee and LlamaIndex, handling structured and unstructured data in LLM workflows for improved accuracy. Try it now!

7 mins read

Hande Kafkas

Jan 10, 2025