Just figured out there are 10 places called Lisbon dotted around the US, according to the search.
Just figured out there are 10 places called Lisbon dotted around the US, according to the search.
Me with four open cli terminals righ now:
https://i.kym-cdn.com/photos/images/original/001/617/650/91a.jpg
Got one more for you: https://gossip.ink/
I use it via a docker/podman container I’ve made for it: https://hub.docker.com/repository/docker/vluz/node-umi-gossip-run/general
I got cancelled too and chose Hetzner instead. Will not do business with a company that can’t get their filters working decently.
Lovely! I’ll go read the code as soon as I have some coffee.
That is extremely better. It is a very interesting problem, as you put it.
We know remarkably little about how AI systems work
Every single time I see this argument used, I stop reading.
I do SDXL generation in 4GB at extreme expense of speed, by using a number of memory optimizations.
I’ve done this kind of stuff since SD 1.4, for the fun of it. I like to see how low I can push vram use.
SDXL takes around 3 to 4 minutes per generation including refiner but it works within constraints.
Graphics cards used are hilariously bad for the task, a 1050ti with 4GB and a 1060 with 3GB vram.
Have an implementation running on the 3GB card, inside a podman container, with no ram offloading, 1 vcpu and 4GB ram.
Graphical UI (streamlit) run on a laptop outside of server to save resources.
Working on a example implementation of SDXL as we speak and also working on SDXL generation on mobile.
That is the reason I’ve looked into this news, SSD-1B might be a good candidate for my dumb experiments.
Goddammit! Don’t tell that one, I use it to impress random people at parties.
Absolutely stellar write up. Thank you!
I have a couple of questions.
Imagine I have a powerful consumer gpu card to trow at this solution, 4090ti for the sake of example.
- How many containers can share one physical card, taking into account total vram memory will not be exceeded?
- How does one virtual gpu look like in the container? Can I run standard stuff like PyTorch, Tensorflow, and CUDA stuff in general?
Oof, pop-culture references are hard and I had not considered that at all.
Thanks for the examples, I’ll have a think on how to deal with those.
My only insight is one you already had.
Test at least the comment before, and then use the output to dampen or amplify the final result.
Sorry for being no help at all.
--
My project is very basic but I’ll post it here for any insight you might get out of it.
I teach Python in a variety of settings and this is part of a class.
The data used is from Kaggle: https://www.kaggle.com/competitions/jigsaw-toxic-comment-classification-challenge/
The original data came from Wikipedia toxic comments dataset.
There is code too from several users, very helpful for some insight into the problem.
Data is dirty and needs clean up so I’ve done so and posted result on HF here:
https://huggingface.co/datasets/vluz/Tox
Model is a very basic TensorFlow implementation intended for teaching TF basics.
https://github.com/vluz/ToxTest
Some of the helper scripts are very wonky, need fixing before I present this in class.
Here are my weights after 30 epochs:
https://huggingface.co/vluz/toxmodel30
And here is it running on a HF space:
https://huggingface.co/spaces/vluz/Tox
deleted by creator
While designing a similar classifier, I’ve considered the idea of giving it the whole thread as “context” of sorts.
Not just the parent comment, the whole thread up to original post.
I’ve abandoned the idea.
A comment must stand on it’s own, and it would put limits on results, the way I was planning to do it.
I might be very wrong, your insight into this would be very helpful.
My original idea was to go recursively trough the thread and test each comment individually.
Then I would influence the actual comment results with the combined results of it’s parents.
No context during inference, just one comment at a time.
For example consider thread OP->C1->C2->C3.
My current model takes milliseconds per test with little resources used.
It would be ok up to very large threads but would contain a limit to save on answer time.
I want to determine if Comment 3 is toxic in the context of C2, C1, and OP.
Test C3, test C2, test C1, test OP. Save results.
My current model gives answer in several fields (“toxic”, “severe toxic”, “obscene”, “threat”, “insult”, and “identity hate”)
The idea was to then combine the results of each into a final result for C3.
How to combine? Haven’t figure it out but it would be results manipulation instead of inference/context, etc.
Edit: Is there any way you can point me at examples difficult to classify? It would be a nice real world test to my stuff.
Current iteration of model is very new and has not been tested in the wild.
Just pip install mscandy -U
DS1 to DS3, I lost count of the hours.
Messing around with system python/pip and newly installed versions till all was broken and then looking at documentation.
This was way back on the 00’s and I’m still ashamed on how fast completely I messed it up.