Abi Raja
|
54b59c85d6
|
add eval runner for text prompt
|
2024-07-19 09:25:11 -04:00 |
|
Abi Raja
|
9d11866143
|
improve evals code
|
2024-07-19 07:55:44 -04:00 |
|
Abi Raja
|
9f732c4f5d
|
update max tokens for Claude Sonnet 3.5 to newly supported limit (8192)
|
2024-07-15 18:51:22 -04:00 |
|
Abi Raja
|
8e6a9c48f8
|
support GPT-4o
|
2024-05-13 15:24:47 -04:00 |
|
Abi Raja
|
a5fe0960d8
|
support best of n evals
|
2024-04-24 14:54:03 -04:00 |
|
Abi Raja
|
bb642b320e
|
improve evaluation docs and the way the model is passed into the evaluation script
|
2024-04-11 10:52:25 -04:00 |
|
Abi Raja
|
bd407e51f9
|
code clean up
|
2024-03-04 14:58:01 -05:00 |
|
Abi Raja
|
c0e084aa86
|
update evals
|
2024-02-21 09:30:42 -05:00 |
|
Abi Raja
|
b8bce72d23
|
organize evals code into the evals dir
|
2024-01-08 17:38:34 -08:00 |
|