Skip to content

Can't reproduce result. #39

@piaoyun2022

Description

@piaoyun2022

Hi, I'm recently try to reproduce your result. I use Deepseek-V3 as the base model. It seems that raw V3 have 41.89% EM on total CWQ test dataset, and Think-on-Graph somewhat gives 45%.
Could you give some advice about that? Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions