A simple path forward, is to go from classifying single elements of training data, to classifying multiple elements and their relationship in the training data.
Slightly less simple, is to gather orders of magnitude more data, by just hooking the input to an IRL robot.
Another step, is for the NN to control the robot and decide which parts of the data require refinement, and focus on that.
There is a lot of ways to improve data acquisition still on the table, it isn't going to stop at creating large corpora and having humans to fine-tune them.