The 6809 can really move some data if you disable the interrupts and use multiple PULS/PSHU instructions that copy 6 or 7 bytes at a time. It's not quite as good as the Z-80's LDIR, but it's a lot better than anything the 6502 can do. This is from a graphics mode screen driver that I wrote for OS9.
SCRUP PSHS U,Y,D
LDX #0
BSR SCRADR
LEAU 6,Y
LEAX $1800,Y
CLR ENDSCR
STS SPSAV
LEAS 256,Y
SCRUP1 PULS D,X,Y
PSHU Y,X,D
LEAU 12,U
PULS D,X,Y
PSHU Y,X,D
LEAU 12,U
PULS D,X,Y
PSHU Y,X,D
LEAU 11,U
PULS A,X,Y
PSHU Y,X,A
LEAU 11,U
DEC ENDSCR
BNE SCRUP1
LEAY -256,S
LDS SPSAV
LDU 4,S
LBSR CLRLN0
PULS D,Y,U,PC
...of course if you have a 6309 you can use the TFM instruction instead.