Commit Graph

9 Commits

Author SHA1 Message Date
Abi Raja
54b59c85d6 add eval runner for text prompt 2024-07-19 09:25:11 -04:00
Abi Raja
9d11866143 improve evals code 2024-07-19 07:55:44 -04:00
Abi Raja
9f732c4f5d update max tokens for Claude Sonnet 3.5 to newly supported limit (8192) 2024-07-15 18:51:22 -04:00
Abi Raja
8e6a9c48f8 support GPT-4o 2024-05-13 15:24:47 -04:00
Abi Raja
a5fe0960d8 support best of n evals 2024-04-24 14:54:03 -04:00
Abi Raja
bb642b320e improve evaluation docs and the way the model is passed into the evaluation script 2024-04-11 10:52:25 -04:00
Abi Raja
bd407e51f9 code clean up 2024-03-04 14:58:01 -05:00
Abi Raja
c0e084aa86 update evals 2024-02-21 09:30:42 -05:00
Abi Raja
b8bce72d23 organize evals code into the evals dir 2024-01-08 17:38:34 -08:00