MathCAMPS Logo

NuminaMath 7B TIR

Performance on individual Common Core standards, grouped by grade level.
IFUP Acc. = Incremental Followup Accuracy, CFUP Acc. = Counterfactual Followup Accuracy, Total FUPs Seen = Number of followup questions a model sees, since a model only sees followups if it answers the main question correctly.

Grade K

Standard Overall Acc. IFUP Acc. CFUP Acc. Total FUPs
K.OA.A.4 0.92 - - -
K.NBT.A.1 0.96 - - -
K.OA.A.5 0.88 0.90 0.96 119
K.CC.C.7 0.79 - 0.94 79

Grade 1

Standard Overall Acc. IFUP Acc. CFUP Acc. Total FUPs
1.OA.A.2 0.96 0.98 0.94 130
1.OA.A.1 0.97 0.85 0.95 139
1.OA.D.8 0.99 - - -

Grade 2

Standard Overall Acc. IFUP Acc. CFUP Acc. Total FUPs
2.NBT.B.7 0.93 0.80 0.96 143
2.MD.B.5 0.97 0.83 0.94 148
2.NBT.B.6 0.89 0.89 0.96 134
2.OA.A.1 0.95 - - -
2.NBT.B.5 0.97 0.88 0.94 133
2.MD.C.8 0.98 0.97 0.94 152

Grade 3

Standard Overall Acc. IFUP Acc. CFUP Acc. Total FUPs
3.OA.D.8 0.89 0.86 0.96 121
3.MD.D.8-quadrilateral 0.89 - - -
3.OA.A.4 0.95 - - -
3.MD.D.8-polygon 0.70 - - -
3.MD.D.8-triangle 0.92 - - -
3.NBT.A.2 0.95 0.88 0.92 147
3.OA.A.3 0.91 0.79 0.79 100
3.OA.C.7 0.96 0.89 0.92 122

Grade 4

Standard Overall Acc. IFUP Acc. CFUP Acc. Total FUPs
4.NBT.B.4 0.91 0.85 0.92 125
4.OA.B.4 0.57 - - -
4.MD.A.2-decimal 0.84 0.87 0.88 114
4.MD.A.3 0.94 - 0.86 80
4.MD.A.2-fraction 0.39 0.80 0.94 46
4.NBT.B.6 0.60 - 0.85 48
4.NF.A.2 0.90 - 0.87 90
4.NBT.B.5 0.85 - 0.87 78
4.OA.A.3 0.45 0.93 0.90 53

Grade 5

Standard Overall Acc. IFUP Acc. CFUP Acc. Total FUPs
5.OA.A.1 0.81 0.85 0.77 84
5.NBT.B.6 0.91 - 0.47 70
5.NF.A.1 0.25 0.54 0.83 48
5.NBT.B.5 0.65 1.00 0.88 58
5.NF.B.4 0.68 0.79 0.83 107
5.NBT.B.7 0.64 0.61 0.91 92
5.NF.A.2 0.45 0.52 0.52 71

Grade 6

Standard Overall Acc. IFUP Acc. CFUP Acc. Total FUPs
6.EE.B.7 0.97 - - -
6.NS.B.2 0.80 - 0.47 38
6.EE.A.1 1.00 - 0.99 97
6.NS.B.3 0.60 0.57 0.86 84

Grade 7

Standard Overall Acc. IFUP Acc. CFUP Acc. Total FUPs
7.NS.A.1-fraction 0.27 0.29 0.52 49
7.NS.A.2 0.75 0.80 0.82 132
7.NS.A.1-decimal 0.91 0.94 0.99 159
7.NS.A.3-fraction 0.25 0.33 0.59 31
7.NS.A.3-decimal 0.76 0.87 0.94 116

Grade 8

Standard Overall Acc. IFUP Acc. CFUP Acc. Total FUPs
8.EE.A.2 0.96 - - -
8.EE.C.8 0.00 - - -
8.EE.C.7 0.62 - - -