GitXplorerGitXplorer
e

hp-ai-benchmark

public
2 stars
0 forks
0 issues

Commits

List of commits on branch main.
Unverified
b53b8c65b3da1eb61279b583c514ba70388bba46

flash

eelitan committed a month ago
Unverified
276c1fb2cdb560c8e37b6296886a4a183cf51bbe

x

eelitan committed 2 months ago
Unverified
f2883d0887afd190ff2b7e00a82b906fbd8ab3f5

x

eelitan committed 2 months ago
Unverified
26c45a66606237a7cb0d84a6a5de176fe16ee8ab

x

eelitan committed 6 months ago
Unverified
de671c69cbd9e6f744f58bb0cbca7f449aab5e3f

x

eelitan committed 7 months ago
Unverified
a93f8b7c171b589d68e9967c3dac3731c1cab587

x

eelitan committed 7 months ago

README

The README file for this repository.

Högskoleprovet AI Benchmark

This repository contains a benchmark for AI models on the Swedish university admissions test, Högskoleprovet.

Results

Högskoleprovet 2024 Spring

Model Verbal Verbal Points Math Math Points Total
claude-3-5-sonnet-20241022 58/60 2.0 61/80 1.5 1.75
gemini-2.0-flash-exp 57/60 1.9 61/80 1.5 1.7
claude-3-5-sonnet-20240620 58/60 2.0 59/80 1.4 1.7
claude-3-5-haiku-20241022 58/60 2.0 - - -
gpt-4o 58/60 2.0 59/80 1.4 1.7
gpt-turbo 57/60 1.9 47/80 1.1 1.5
gpt-4o-mini 55/60 1.8 46/80 1.1 1.45
claude-3-opus-20240229 58/60 2.0 39/80 0.8 1.4
gemini-pro-vision 56/60 1.9 - - -
claude-3-sonnet-20240229 55/60 1.8 - - -
gemini-1.5-flash-latest 54/60 1.7 - - -
claude-3-haiku-20240307 53/60 1.7 - - -
gpt-3.5-turbo 51/60 1.6 - - -
llama3-70b 50/60 1.5 - - -
llama3-7b 31/60 0.8 - - -

Test

https://www.studera.nu/hogskoleprov/fpn/provfragor-och-facit-varen-2024/