Your two column approach, while more descriptive, somehow seems to lack explanatory power to me. I don't think it would clear up confusion in most cases (but that's just my intuition). I wrote the text that's implied, which your explanation doesn't have.
I suppose I didn't explain what the implied image is, but the meme format is well known at this point -- basically anyone can infer the missing images, which are generally the same from meme to meme, but the missing text is the hard part to infer.