The ATmega328P datasheet says:
each register is also assigned a data memory address, mapping them directly into the first 32 locations of the user data space.
Which suggests I can either use `r1` in my assembly file or load/store to data address `0x1`? They both accomplish the same thing?
If every instruction is also executed in 1 clock cycle then does it even make a difference in performance?