I don't think open-sourcing matters too much here: the datasets are all open sourced and the technique itself is published and well documented. OpenAI's advantage ia that they can burn the $10 million in processor energy that it costs to train each new model of GPT.
edit: Up through GPT-3, that is (GPT-4 is worse and I don't know why people would want it)