Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BFCL] Wether there is any semantic checker for the parameters' value? #643

Open
Ghevil opened this issue Sep 18, 2024 · 1 comment
Open
Labels
BFCL-Dataset BFCL Dataset-Related Issue BFCL-General General BFCL Issue

Comments

@Ghevil
Copy link

Ghevil commented Sep 18, 2024

Describe the issue
When we do the evaluation, do we have some checker that checks the semantic consistency of the parameters instead of simply matching them from the candidate list?

It seems too harsh that some minor errors that lead to errors, such as San Diego and San Diego, CA are different in format, but the two are semantically consistent.

What's more, like this error "Invalid value for parameter 'items': ['pumpkin', 'egg']. Expected one of [['pumpkins', 'eggs'], ['pumpkin', 'dozen eggs']]."

Looking forward to your reply.

@HuanzhiMao
Copy link
Collaborator

Hi @Ghevil,

Unfortunately, we don't have any semantic checkers at this moment.

In addition to trying to make the candidate list as comprehensive as possible, we have also tried to be clear and pose restrictions on the parameter formats in the function documentation. For example, in the location example you provided, we would phrase it as the following so that only San Diego, CA is correct and San Diego would be wrong.

{
    "location": {
        "type": "string",
        "description": "The location in 'city, state' format."
    }
}

As another example, for things like date, there would only be one correct ground truth.

{
    "end_date": {
        "type": "string",
        "description": "The ending date until which to retrieve stock prices. Format: 'yyyy-mm-dd'."
    }
}

If you have any other good solutions, we would love to hear your thoughts.

@HuanzhiMao HuanzhiMao added BFCL-General General BFCL Issue BFCL-Dataset BFCL Dataset-Related Issue labels Sep 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
BFCL-Dataset BFCL Dataset-Related Issue BFCL-General General BFCL Issue
Projects
None yet
Development

No branches or pull requests

2 participants