MathCAMPS Logo

GPT-3.5 Turbo

Performance on individual Common Core standards, grouped by grade level.
IFUP Acc. = Incremental Followup Accuracy, CFUP Acc. = Counterfactual Followup Accuracy, Total FUPs Seen = Number of followup questions a model sees, since a model only sees followups if it answers the main question correctly.

Grade K

Standard Overall Acc. IFUP Acc. CFUP Acc. Total FUPs
K.OA.A.4 0.97 - - -
K.NBT.A.1 0.95 - - -
K.OA.A.5 0.93 0.98 1.00 124
K.CC.C.7 0.99 - 1.00 99

Grade 1

Standard Overall Acc. IFUP Acc. CFUP Acc. Total FUPs
1.OA.A.2 0.96 1.00 1.00 129
1.OA.A.1 0.99 0.92 0.99 140
1.OA.D.8 1.00 - - -

Grade 2

Standard Overall Acc. IFUP Acc. CFUP Acc. Total FUPs
2.NBT.B.7 0.98 0.97 0.96 153
2.MD.B.5 0.99 0.94 0.95 150
2.NBT.B.6 0.98 0.95 0.96 145
2.OA.A.1 0.97 - - -
2.NBT.B.5 0.97 0.90 0.98 134
2.MD.C.8 0.98 1.00 0.99 153

Grade 3

Standard Overall Acc. IFUP Acc. CFUP Acc. Total FUPs
3.OA.D.8 0.93 0.92 0.97 123
3.MD.D.8-quadrilateral 0.99 - - -
3.OA.A.4 0.97 - - -
3.MD.D.8-polygon 0.95 - - -
3.MD.D.8-triangle 0.99 - - -
3.NBT.A.2 1.00 0.92 1.00 155
3.OA.A.3 0.95 0.97 0.72 106
3.OA.C.7 0.98 0.97 0.82 127

Grade 4

Standard Overall Acc. IFUP Acc. CFUP Acc. Total FUPs
4.NBT.B.4 1.00 0.88 0.96 136
4.OA.B.4 0.98 - - -
4.MD.A.2-decimal 0.93 0.92 0.90 127
4.MD.A.3 1.00 - 1.00 86
4.MD.A.2-fraction 0.50 0.63 0.85 60
4.NBT.B.6 0.79 - 0.82 66
4.NF.A.2 0.89 - 0.88 89
4.NBT.B.5 0.95 - 0.92 88
4.OA.A.3 0.68 0.85 0.91 74

Grade 5

Standard Overall Acc. IFUP Acc. CFUP Acc. Total FUPs
5.OA.A.1 0.96 0.75 0.77 102
5.NBT.B.6 1.00 - 0.04 75
5.NF.A.1 0.42 0.35 0.81 74
5.NBT.B.5 0.81 1.00 0.90 72
5.NF.B.4 0.78 0.86 0.90 117
5.NBT.B.7 0.70 0.64 0.97 101
5.NF.A.2 0.73 0.77 0.81 120

Grade 6

Standard Overall Acc. IFUP Acc. CFUP Acc. Total FUPs
6.EE.B.7 1.00 - - -
6.NS.B.2 0.97 - 0.11 47
6.EE.A.1 0.92 - 0.97 89
6.NS.B.3 0.71 0.66 0.87 98

Grade 7

Standard Overall Acc. IFUP Acc. CFUP Acc. Total FUPs
7.NS.A.1-fraction 0.66 0.41 0.70 124
7.NS.A.2 0.89 0.74 0.87 155
7.NS.A.1-decimal 0.99 0.98 0.99 175
7.NS.A.3-fraction 0.37 0.83 0.82 45
7.NS.A.3-decimal 0.92 0.85 0.96 133

Grade 8

Standard Overall Acc. IFUP Acc. CFUP Acc. Total FUPs
8.EE.A.2 0.99 - - -
8.EE.C.8 0.03 - - -
8.EE.C.7 0.66 - - -