Welcome to Incremental Social! Learn more about this project here!
Check out lemmyverse to find more communities to join from here!

alvaro ,

[question] Alt-text service, self hosted

Is there a service that I could plug into my misskey/firefish/whatev that provides an alt-text for an image I'm posting. I don't post images that often, so the few times, it would be nice to have an automatic OCR or LLM/Stable Diffusion description of a picture attached to it.

Of course, without depending on Google or similars.

Any ideas? cc @selfhost @selfhosted

Deebster ,
@Deebster@programming.dev avatar

I feel that auto-generated descriptions are going to generally be terrible, even with the new GPT AIs. There's too much context needed to do a good job to be able to just feed an image into some code and get something useful.

On the other hand, transcriptions should be able to be done more accurately, particularly with a bit of extra logic to recognise forms like Twitter posts.

Some database of alt-texts might be possible by scraping for alt-texts and transcriptions from the fediverse, reddit, etc, but a quick search didn't come up with anything.

alvaro OP ,

@Deebster maybe just OCR would be enough for a lot of cases

Deebster ,
@Deebster@programming.dev avatar

Yeah, that's what I mean by transcriptions; if you're mostly posting screenshots of social media posts then it'd probably work quite well, but if it's photos you're definitely going to need something more complicated.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • selfhosted@lemmy.world
  • random
  • incremental_games
  • meta
  • All magazines