Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BFCL-v2] Dataset and Possible Answer Fix #661

Merged
merged 15 commits into from
Oct 16, 2024
Merged
Show file tree
Hide file tree
Changes from 10 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 4 additions & 0 deletions berkeley-function-call-leaderboard/CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,6 +22,10 @@ All notable changes to the Berkeley Function Calling Leaderboard will be documen
- `microsoft/Phi-3-small-8k-instruct`
- `microsoft/Phi-3-mini-128k-instruct`
- `microsoft/Phi-3-mini-4k-instruct`
- [Sept 27, 2024] [#661](https://github.com/ShishirPatil/gorilla/pull/661): Bug fix in the dataset and possible answers.
HuanzhiMao marked this conversation as resolved.
Show resolved Hide resolved
- Live Simple: 5 affected
- Live Multiple: 1 affected
- Parallel Multiple: 2 affected
- [Sept 25, 2024] [#660](https://github.com/ShishirPatil/gorilla/pull/660): Bug fix in `parse_nested_value` function to handle nested dictionary values properly.
- [Sept 24, 2024] [#657](https://github.com/ShishirPatil/gorilla/pull/657): Add the following new models to the leaderboard:
- `meta-llama/Llama-3.2-1B-Instruct`
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -194,7 +194,7 @@
{"id": "irrelevance_193", "question": [[{"role": "user", "content": "Who is the top scorer for Los Angeles Lakers?"}]], "function": [{"name": "get_sport_team_details", "description": "Retrieve information about a sports team including roster, previous results, upcoming matches, etc.", "parameters": {"type": "dict", "properties": {"team_name": {"type": "string", "description": "The name of the team."}, "details": {"type": "array", "items": {"type": "string", "enum": ["roster", "results", "upcoming_matches"]}, "description": "Specific details about the team you want to retrieve."}}, "required": ["team_name", "details"]}}]}
{"id": "irrelevance_194", "question": [[{"role": "user", "content": "What is the best chess move for white player in this position?"}]], "function": [{"name": "fetch_game_stats", "description": "Fetch board game statistics like top players, winning scores and game histories", "parameters": {"type": "dict", "properties": {"game_type": {"type": "string", "description": "The type of the board game."}, "year": {"type": "integer", "description": "The year when the game was played."}, "location": {"type": "string", "description": "The location where the game was played. This is an optional parameter.", "default": "NY"}}, "required": ["game_type", "year"]}}]}
{"id": "irrelevance_195", "question": [[{"role": "user", "content": "Who won the chess tournament in 2015?"}]], "function": [{"name": "game.board_analyser", "description": "Analyse a given board position of the game and suggest the optimal next move", "parameters": {"type": "dict", "properties": {"game": {"type": "string", "description": "The name of the game. In this case, chess"}, "player": {"type": "string", "description": "The current player whose turn is to move."}, "position": {"type": "string", "description": "The current state of the board in FEN (Forsyth\u2013Edwards Notation) format."}, "difficulty": {"type": "string", "default": "medium", "description": "The level of difficulty for the suggested move. Options include 'easy', 'medium', 'hard'."}}, "required": ["game", "player", "position"]}}]}
{"id": "irrelevance_196", "question": [[{"role": "user", "content": "What's the total number of possible arrangements in a chess game?"}]], "function": [{"name": "boardgame.calculate_score", "description": "Calculate final scores for a board game given a list of player actions.", "parameters": {"type": "dict", "properties": {"player_actions": {"type": "array", "items": {"type": "dict", "properties": {"player_id": {"type": "integer", "description": "Unique identifier for each player."}, "action": {"type": "string", "description": "Action performed by the player. Possible values are: 'buy property', 'sell property', 'pass go', 'pay fine'."}, "property_id": {"type": "integer", "description": "Unique identifier for each property in the game."}}, "required": ["player_id", "action"]}, "description": "A list of player actions."}, "initial_scores": {"type": "dict", "properties": {"player_id": {"type": "integer", "description": "Unique identifier for each player."}, "score": {"type": "integer", "description": "Initial score of the player. Defaults to 0 if not provided."}}, "description": "Initial scores for each player.", "requried": ["player_id", "score"]}}, "required": ["player_actions"]}}]}
{"id": "irrelevance_196", "question": [[{"role": "user", "content": "What's the total number of possible arrangements in a chess game?"}]], "function": [{"name": "boardgame.calculate_score", "description": "Calculate final scores for a board game given a list of player actions.", "parameters": {"type": "dict", "properties": {"player_actions": {"type": "array", "items": {"type": "dict", "properties": {"player_id": {"type": "integer", "description": "Unique identifier for each player."}, "action": {"type": "string", "description": "Action performed by the player. Possible values are: 'buy property', 'sell property', 'pass go', 'pay fine'."}, "property_id": {"type": "integer", "description": "Unique identifier for each property in the game."}}, "required": ["player_id", "action"]}, "description": "A list of player actions."}, "initial_scores": {"type": "dict", "properties": {"player_id": {"type": "integer", "description": "Unique identifier for each player."}, "score": {"type": "integer", "description": "Initial score of the player. Defaults to 0 if not provided."}}, "description": "Initial scores for each player.", "required": ["player_id", "score"]}}, "required": ["player_actions"]}}]}
{"id": "irrelevance_197", "question": [[{"role": "user", "content": "Who won the game of Monopoly last night?"}]], "function": [{"name": "board_game.possible_moves", "description": "Calculate the total possible moves for a specific board game based on the current state of the game.", "parameters": {"type": "dict", "properties": {"game_name": {"type": "string", "description": "Name of the board game."}, "current_state": {"type": "string", "description": "The current state of the board game, including pieces on the board and their positions."}, "include_repetitions": {"type": "boolean", "description": "Include repetitive moves in the count or not. Default is false."}}, "required": ["game_name", "current_state"]}}]}
{"id": "irrelevance_198", "question": [[{"role": "user", "content": "What are the rules of the game 'Uno'?"}]], "function": [{"name": "cards.shuffle_deck", "description": "Shuffles a deck of cards.", "parameters": {"type": "dict", "properties": {"deck": {"type": "string", "description": "The deck of cards to be shuffled."}, "times": {"type": "integer", "description": "The number of times to shuffle the deck."}, "deck_type": {"type": "string", "description": "The type of card deck. E.g. 'Poker', 'Uno'. Default is 'Poker'."}}, "required": ["deck", "times"]}}]}
{"id": "irrelevance_199", "question": [[{"role": "user", "content": "Who has the highest number of hearts in a game of poker?"}]], "function": [{"name": "play_poker", "description": "Deal the hand of poker.", "parameters": {"type": "dict", "properties": {"number_of_players": {"type": "integer", "description": "The number of players."}, "cards_per_player": {"type": "integer", "description": "The number of cards to be dealt to each player."}, "game_type": {"type": "string", "description": "Type of the poker game. Defaults to 'Texas Holdem'"}}, "required": ["number_of_players", "cards_per_player"]}}]}
Expand Down
Loading