Some exciting projects from the last months:
- 3d scene reconstruction from a few images: https://dust3r.europe.naverlabs.com/
- gaussian avatars: https://shenhanqian.github.io/gaussian-avatars
- relightable gaussian codec: https://shunsukesaito.github.io/rgca/
- track anything: https://co-tracker.github.io/ https://omnimotion.github.io/
- segment anything: https://github.com/facebookresearch/segment-anything
- good human pose estimate models: (Yolov8, Google's mediapipe models)
- realistic TTS: https://huggingface.co/coqui/XTTS-v2, bark TTS (hit or miss)
- open great STT (mostly whisper based)
- machine translation (ex: seamlessm4t from meta)
It's crazy to see how much is coming out of Meta's R&D alone.
They have the money...
and data
Hundreds of thousands of H100s…
And a dystopian vision for the future that can make profitable use of the above ...
On the plus side, people make up the organization and when they eventually grow fed up with the dystopia, they leave with their acquired knowledge and make their own thing. So dystopias aren't stable in the long term.
The Ones Who Walk Away From O-Meta-s
A very apt reference to the story
The ones who walk away from Omelas
Dunno how pasting a link works but here it is:
https://shsdavisapes.pbworks.com/f/Omelas.pdf
I feel vaguely annoyed, I think it's because it took a lot of time to read through that, and it amounts to "bad to put child in solitary confinement to keep whole society happy."
What does a simplistic moral set piece about the abhorrence of sacrificing the good of one for the good of many have to do with (check notes) Facebook? Even as vague hand-wavey criticism, wouldn't Facebook would be the inverse?
For some people this is a stable dystopia.
Unless they think to hire new people.
That seems to rely on the assumption that human input is required to keep the dystopia going. Maybe I watched too much sci-fi, but the more pessimistic view is that the AI dystopia will be self-sustaining and couldn't be overcome without the concerted use of force by humans. But we humans aren't that good in even agreeing on common goals, let alone exerting continuous effort to achieve them. And most likely, by the time we start to even think of organizing, the AI dystopia will be conducting effective psychological warfare (using social media bots etc.) to pit us against each other even more.
So the dystopia spreads out... Metastasis
and (rumours say) engineers who will bail if Meta doesn’t let them open source
Whoa, Bark got a major update recently. Thanks for the link as a reminder to check in on that project!
Can you share what update you are referring to ?
I've played with Bark quite extensively a few month ago and I'm on the fence regarding that model: when it works, it's the best, but I found it to be pretty useless for most use-case I want to use TTS for because of the high rate of bad or weird output.
I'm pretty happy with XTTv2 though. It's reliable and output quality is still pretty good.
Not sure how relevant this is but note that Coqui TTS (the realistic TTS) has already shut down
https://coqui.ai
- streaming and rendering 3d movies in real-time using 4d gaussian splatting https://guanjunwu.github.io/4dgs/