Put the mouse-face on the entire xref, like the local keymap.
Attach a file by drag & drop or click to upload