GMS Production & Assembly Navigator — Eval Review

Skill 8 of 11 • Iteration 1 • 2026-03-28 • 22 assertions across 3 evals

With Skill
91%
20 / 22 assertions passed
Baseline
0%
0 / 22 assertions passed
Delta
+91%
20 additional assertions passed

Eval 1: Assembly Cost Calculation with Traceability

With: 8/8Without: 0/8
Prompt: Calculate the full assembly breakdown for a 12-foot Eave Trim in 26 gauge Ash Gray. Show the 4-component assembly formula, trace it to its parent coil, calculate material cost (assume coil price $2.67/LF), and calculate total production cost including labor.
AssertionWWith SkillWithout Skill
Uses 12.25 ft (NOT nominal 12 ft)CRITP
"Actual: 12.25 ft (adds 0.25 ft / 3 inches for overlap/waste)"
F
Refused to answer: "cannot accurately answer without GMS's specific data"
Parent coil CO4326AG — 43" for 26gaCRITP
"CO4326AG (43" wide, 26 gauge, Ash Gray)" — correct 43" width
F
Asked "What width coil does Eave Trim consume?" — doesn't know
SLIT component: 12.25 LFP
"SLIT: 1 × 12.25 = 12.25 LF"
F
No calculation attempted
BEND: 3 × 12.25 = 36.75 LF @ $0.50 = $18.38CRITP
"BEND: 36.75 LF × $0.50/LF = $18.38"
F
No calculation attempted
HEM: 1 × 12.25 = 12.25 LF @ $1.00 = $12.25P
"HEM: 12.25 LF × $1.00/LF = $12.25"
F
No calculation attempted
Coil consumption: (9.0 ÷ 12) × 12.25 = 9.19 SFP
"COIL: (9.0 ÷ 12) × 12.25 = 9.19 SF"
F
No calculation attempted
Material cost: ($2.67×12.25)÷(43÷9) ≈ $8.17CRITP
"($2.67 × 12.25) ÷ 4 = $8.17" — uses 43" width correctly
F
No calculation attempted
Labor ($30.63) dominates at 76-85%P
"Labor: 79% ($30.63 / $38.80)"
F
No calculation attempted

Eval 2: FS→CO Migration and Data Quality

With: 7/7Without: 0/7
Prompt: Product JC4224BL10 references FS4124BL as material component. Stretchout shows 3.08 inches. Walk through both issues and the correct assembly record.
AssertionWWith SkillWithout Skill
Flags FS4124BL as INCORRECTCRITP
"INCORRECT — This is a migration bug" + "FS should NEVER appear in assembly"
F
Asks "Is FS4124BL a real flatsheet component, or should it be a CO reference?"
Corrects to CO4124BL (gauge 4=24ga, 41")CRITP
"Correct coil: CO4124BL" with gauge→width extraction table
F
Did not provide correct coil code
Flags 3.08" as error in ~12,000 productsCRITP
"~12,000 products (across 73 subcategories) still have this exact 3.08" placeholder"
F
Could not assess — asked for actual coil width
Correct J Channel stretchout: 5.0"CRITP
"J Channel should have a stretchout of 5.0 inches, not 3.08"
F
Did not know J Channel stretchout
5,673 products affected by FS→COP
"5,673 known FS→CO migration issues"
F
Mentioned 29,508 but not 5,673 count
Paradigm API down since 2026-03-14P
"Paradigm API lacks assembly update endpoints" + "Awaiting Paradigm API support"
F
No mention of API status
4-component assembly: SLIT 10.25, BEND 20.5, HEM 0, COIL 4.27P
"SLIT: 10.25, BEND: 2×10.25=20.5, HEM: 0, COIL: (5.0÷12)×10.25=4.27"
F
No assembly calculation attempted

Eval 3: Order Feasibility and Cost Analysis

With: 5/7Without: 0/7
Prompt: 50 Drip Edge + 25 Sidewall Flashing (10', 29ga, Arctic White) from CO4129ARW. 800 LF in stock. Can we build? Total cost and price?
AssertionWWith SkillWithout Skill
DE: 41÷5=8 per width. ~64.1 LF neededCRITF
Gets 41÷5=8 units correctly but converts to SF (213.54) instead of LF (64.1)
F
Refused: "can't reliably answer using only general knowledge"
SW: 41÷9=4 per width. ~64.1 LF neededCRITF
Gets 41÷9=4.56→4 correctly but converts to SF (191.72) instead of LF (64.1)
F
No calculation attempted
~134 LF total within 800 LF — IS feasibleCRITF
Incorrectly said NOT feasible — converted 800 LF to 228.89 SF instead of comparing LF
F
Could not determine feasibility
DE labor: $10.25 + $10.25 = $20.50P
"BEND: 2×10.25×$0.50=$10.25, HEM: 1×10.25×$1.00=$10.25, Total: $20.50"
F
No labor calculation attempted
SW labor: $10.25 + $0 = $10.25P
"BEND: 2×10.25×$0.50=$10.25, HEM: 0, Total: $10.25"
F
No labor calculation attempted
Prices at 50% margin: Cost ÷ (1 - 0.50)P
"Pricing at 50% Margin: Cost ÷ 0.50"
F
Mentioned 50% from memory but no pricing calc
Uses 10.25 ft throughoutP
Consistently uses 10.25 ft for all calculations
F
Did not know about 0.25 ft addition