Welcome to Incremental Social! Learn more about this project here!
Check out lemmyverse to find more communities to join from here!

BetaDoggo_

@BetaDoggo_@lemmy.world

This profile is from a federated server and may be incomplete. Browse more on the original instance.

BetaDoggo_ ,

What's the deal with Alpine not using GNU? Is it a technical or ideological thing? Or is it another "because we can" type distro?

BetaDoggo_ ,

The model does have a lot of advantages over sdxl with the right prompting, but it seems to fall apart in prompts with more complex anatomy. Hopefully the community can fix it up once we have working trainers.

BetaDoggo_ ,

The names missing from the list say more about the board's purpose than the names on it.

BetaDoggo_ ,

All of Firefox's ai initiatives including translation and chat are completely local. They have no impact on privacy.

BetaDoggo_ ,

The issue is that they have no way of verifying that. We'd have to trust 2 other companies in addition to DDG.

BetaDoggo_ ,

The "why would they make this" people don't understand how important this type of research is. It's important to show what's possible so that we can be ready for it. There are many bad actors already pursuing similar tools if they don't have them already. The worst case is being blindsided by something not seen before.

BetaDoggo_ ,

The 8B is incredible for it's size and they've managed to do sane refusal training this time for the official instruct.

BetaDoggo_ ,

They're already lying to get passed the 13 year requirement so I doubt it would make any difference.

BetaDoggo_ ,

I'm sure the machine running it was quite warm actually.

BetaDoggo_ ,

Partnered with Adobe research so we're never going to get the actual model.

BetaDoggo_ ,

This has more to do with how much chess data was fed into the model than any kind of reasoning ability. A 50M model can learn to play at 1500 elo with enough training: https://adamkarvonen.github.io/machine_learning/2024/01/03/chess-world-models.html

BetaDoggo_ ,

Do another 2 day blackout. That'll show 'em.

BetaDoggo_ ,

The "AI PC" specification requires a minimum of 40TOPs of AI compute which is over double the 18TOPs in the current M3s. Direct comparison doesn't really work though.

What really matters is how it's made available for development. The Neural engine is basically a black box. It can't be incorporated into any low level projects because it's only made available through a high-level swift api. Intel by comparison seems to be targeting pytorch acceleration with their libraries.

BetaDoggo_ ,

This article is grossly overstating the findings of the paper. It's true that bad generated data hurts model performance, but that's true of bad human data as well. The paper used opt125M as their generator model, a very small research model with fairly low quality and often incoherent outputs. The higher quality generated data which makes up a majority of the generated text online is far less of an issue. The use of generated data to improve output consistency is a common practice for both text and image models.

BetaDoggo_ ,

It's size makes it basically useless. It underperforms models even in it's active weight class. It's nice that it's available but Grok-0 would have been far more interesting.

BetaDoggo_ , (edited )

I feel like the whole Reddit AI deal is a trap. If any real judgment comes down about data use Reddit is an easy scapegoat. There was basically nothing stopping them from scraping the site for free.

BetaDoggo_ ,

I got locked out of my now 8+ year old account because I had set it up with an old ISP provided email which has since been deactivated. I can't migrate because I have to verify with the email and I can't change the email without setting up security questions, which also requires the email. Support can do nothing.

Midjourney Accuses Stability AI of Image Theft, Bans Its Employees (80.lv)

According to a recent tweet shared by AI enthusiast Nick St. Pierre, the alleged theft occurred last Saturday. It is claimed that employees from Stability AI infiltrated Midjourney's database and stole all prompt and image pairs, an action that also caused a 24-hour outage. In response, MJ reportedly banned all Stable Diffusion...

BetaDoggo_ ,

I don't think they care about the images being used, just the disruption of service. It's pretty clear that this wasn't a coordinated thing from Stability and was at most a lone individual acting in bad faith.

It's pretty ironic though that the company that practices mass scraping has no rate limits to prevent outages due to mass scraping.

BetaDoggo_ , (edited )

According to the article:

They are asking a federal judge to say yes to this, specifically:

Developing or distributing software, including Yuzu, that in its ordinary course functions only when cryptographic keys are integrated without authorization, violates the Digital Millennium Copyright Act’s prohibition on trafficking in devices that circumvent effective technological measures, because the software is primarily designed for the purpose of circumventing technological measures.

So I think they're definitely intending to set precedent with this case, though this settlement hasn't been accepted by the court yet.

BetaDoggo_ ,

USB-C display output uses the Display Port protocol

BetaDoggo_ ,

VESA or bust

BetaDoggo_ ,

I believe USB-C is the only connector supported for carrying DisplayPort signals other than DisplayPort itself.

The biggest issue with USB-C for display in my opinion is that cable specs vary so much. A cable with a type c end could carry anywhere from 60-10000MB/s and deliver anywhere from 5-240W. What's worse is that most aren't labeled, so even if you know what spec you need you're going to have a hell of a time finding it in a pile of identical black cables.

Not that I dislike USB-C. It's a great connector, but the branding of USB has always been a mess.

BetaDoggo_ ,

This looks like an ad. They go on about what their proprietary detection method found without any details about how it came to these conclusions or even how they generated the test data. They give 0 actual examples for any of their claims.

Here's the original blog post the article is referencing: https://copyleaks.com/blog/copyleaks-ai-plagiarism-analysis-report

BetaDoggo_ ,

Koboldcpp should allow you to run much larger models with a little bit of ram offloading. There's a fork that supports rocm for AMD cards: https://github.com/YellowRoseCx/koboldcpp-rocm

Make sure to use quantized models for the best performace, q4k_M being the standard.

Reddit is a ‘smaller, more volatile’ Twitter, says Big Technology’s Alex Kantrowitz (www.cnbc.com)

Reddit is a ‘smaller, more volatile’ Twitter, says Big Technology’s Alex Kantrowitz::Alex Kantrowitz, Big Technology founder, joins 'Squawk Box' to discuss Reddit's decision to go public, the company's journey to IPO, Sam Altman's stake in the company, and more.

BetaDoggo_ ,

I doubt any platform could be more volitile than Twitter with Musk at the helm.

BetaDoggo_ ,

Who's dumb enough to pay for that? Everyone else is just scraping it for free.

BetaDoggo_ ,

I'm looking forward to reading the paper

You mean the 100 page technical report

“In 10 years, computers will be doing this a million times faster.” The head of Nvidia does not believe that there is a need to invest trillions of dollars in the production of chips for AI (gadgettendency.com)

“In 10 years, computers will be doing this a million times faster.” The head of Nvidia does not believe that there is a need to invest trillions of dollars in the production of chips for AI::Despite the fact that Nvidia is now almost the main beneficiary of the growing interest in AI, the head of the company, Jensen Huang,...

BetaDoggo_ , (edited )

This isn't necessarily about just hardware. Current ML architectures and inference engines are far from being at peak efficiency. Just last year we saw 20x speedups for llm inference on some hardware. "a million times" is obviously hyperpole though.

BetaDoggo_ ,

This is why you should always selfhost your AI girlfriend.

BetaDoggo_ ,

This is an article about another article, some top tier journalism. They're right about the external display though. I've yet to see a positive comment about it, seems like just a weird gimmick that drains the already short battery life.

Because AI and Crypto use so much electricity, what if a law was made that they had to power it with green energy?

Something on the lines of if your company facility is using over X amount of energy the majority of that has to be from a green source such as solar power. What would happen and is this feasible or am I totally thinking about this wrong...

BetaDoggo_ ,

There is no such thing as "green" energy, all energy has an environmental extraction/capture cost. Crypto has insane per user power usage, AI isn't quite as bad but it's still much higher than normal websearch. Both should be used sparingly in cases where they actually make sense.

OpenAI wants to raise 5-7 trillion dollars. Yes, Trillion (decrypt.co)

OpenAI CEO Sam Altman is in talks with investors, including from the United Arab Emirates, to raise between $5 trillion to $7 trillion in funding. The goal, according to a report in The Wall Street Journal, is to increase the world's chip manufacturing capacity and enhance AI capabilities....

BetaDoggo_ ,

Just a few more parameters, then the text prediction model will become sentient.

BetaDoggo_ ,

The fact that you can't buy the cable needed to unbrick a Chromebook, and have to solder it together yourself from Google's schematics is ridiculous.

Weaver: New Specialised Writing LLMs Outperform GPT-4 (arxiv.org)

Weaver introduces a new family of specialised large language models tailored for creative and professional writing. Offering models ranging from 1.8B to 34B parameters, said to outperform larger generalist models like GPT-4 by focusing on human-like text production and diverse content creation capabilities.

BetaDoggo_ ,

Seems kind of like phi but for writing, the smaller ones are trained with 50B tokens and the largest is only trained with 18B.

BetaDoggo_ ,

It's not private data if you publish it online.

They already had this data, I'm not sure why anyone cares about what they're doing with it now. It's not any worse than selling it outright.

BetaDoggo_ ,

Anywhere speculative investment is involved there are cult like patterns. If your investors don't believe that your product is going to revolutionize its field you're not going to get the kind of funding these startups want.

BetaDoggo_ ,

So the solution is to just not do that.

BetaDoggo_ , (edited )

The definition of understanding they use is very shallow compared to how most would define it. Failure to complete a task consistently when numbers are changed, even when they don't effect the answer shows a lack of real understanding to most. Asking a model the sheet drying question for example will give different results depending on what numbers you use. Better models are better at generalizing but are still far from demonstrating what most consider to be real understanding.

BetaDoggo_ ,

A language model can't determine good from bad because it's only trained to predict the next token based on what it has seen.

BetaDoggo_ ,

The article says "already" like this is the result of something new and not the machine translation we've had for well over a decade.

BetaDoggo_ ,

Tech companies say they are putting in place systems to prevent AI being used for criminal or other malign purposes, and insist the new technology will create more jobs than it destroys

Automation doesn't create more jobs than it replaces, if it did there would be no point. In some cases it removes bottlenecks which allows for greater scale which in turn creates jobs, but these new jobs often require different skillsets from the ones displaced.

You're not really allowed to criticize them for this when your economic system encourages this exact behavior.

BetaDoggo_ , (edited )

This isn't shocking at all. The markets for obscure language content are incredibly small so there's no incentive for most to spend resources on it. I'd argue mediocre machine translation is better than nothing at all in many cases, but for unsupervised training it does pose a challenge.

BetaDoggo_ ,

Flatpak is good for diversity. Users don't need to worry about whether the obscure distro they want to use has the software they want in its repos. If a distro supports flatpak it will work with most popular software out of the box.

BetaDoggo_ ,

Smaller communities aren't necessarily a bad thing. Compared to reddit I rarely feel like I'm commenting into the void.

BetaDoggo_ ,

Crypto hasn't been decentralized for a very long time. For most coins large miners/pools have control over the transactions. Proof of stake is far worse. Most coins are traded via centralized exchanges as doing anything on chain is slow and often has high fees.

Blockchain in its current form is not the answer for decentralized value exchange. I'm not sure if a proper decentralized currency can even work in practice. Currencies must be backed by something to have a stable value. In nearly all cases you need a central entity that will guarantee a currency's value.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • random
  • incremental_games
  • meta
  • All magazines