Evaluations/Benchmark gpt-4o-mini "Prompt + Unit Test → Code"
main
cargo_test_passed_eval.parquet
texttext
MetaMeta/Llama 3.1 8B Instruct
Fireworks AI Fireworks AI
code
You are a pragmatic Rust programmer who enjoys test driven development. Given the following question and unit tests, write a Rust function to complete the task. Make the code simple and easy to understand. The code should pass `cargo build` and `cargo clippy`. Do not add a main function. Try to limit library usage to the standard library std. Respond with only the Rust function and nothing else. Be careful with your types, and try to limit yourself to the basic built in types and standard library functions. When writing the function you can think through how to solve the problem and perform reasoning in the comments above the function.

An example output should look like the following:

```rust
/// Reasoning goes here
/// and can be multi-line
fn add_nums(x: i32, y: i32) -> i32 {
  x + y
}
```

Make sure to only respond with a single  ```rust``` block.

Question:
{rust_prompt}

Unit Tests:
{rust_test_list}

Code:
Mar 1, 2025, 10:25 PM UTC
Mar 1, 2025, 10:25 PM UTC
5 row sample
3418 tokens$ 0.0007
5 rows processed, 3418 tokens used ($0.0007)
Estimated cost for all 500 rows: $0.0684
Sample Results completed
5 columns, 1-5 of 500 rows