AI-generated code compiles. The UI doesn't always match.
Cursor, Claude Code, and v0 generate working code fast. But LLMs take shortcuts, drop visual context as prompts grow, and prioritise logic over precision. The result compiles. It just doesn't always look like the design.
Spotting the drift manually means opening Figma and the browser side by side, eyeballing type sizes, toggling between tabs to check spacing values. Every iteration puts you back at the start of that process.
Generation and verification are two different problems. Most tools only solve the first.