That is available in regular ComfyUI, though theirs does look very easy to use.
The neatest prompting tool idea I saw was one that used an LLM to do prompt refinement for the image, though the implementation I found was more antagonistic than helpful. It garbled my prompts into near-synonyms, giving me a basically equally crappy prompt but for a very different image.
The varied prompts are very neat too, where you give it a template string and a list of options to interpolate and it generates an image for each combo or random combos or etc. Very nice when you want to see the same image with different moods, lighting, backgrounds, etc.
This is backed by both github, and YC? That's pretty impressive. Though sadly the demo on the front-page gives no results for me, just endless animations.
What's the difference between your offering and the plethora of others? [1] Do you offer all checkpoints and LoRas even those who are not free to use in generation services or the IP-adapters with insightface as requirements? Rather fishy tbh.
Also offering workflows from your team that infringe on copyrights from the mouse is a bold move.
We don't offer any non-free-to-use checkpoints/LORAs on the platform out-of-the-box. If you wanted to import a model/lora to your own account from a CivitAI link you can do so, but we're at this point letting users do this under good faith that they have permission to use these models.
We also have face IP-adapter nodes in our app!
I would say there are a lot of more minor UX differences about our app than other offerings. One particular favorite of mine is the ability to switch between the Node Editor screen and the Playground screen using cmd+p.
We built this feature mainly bc the node editor isn't great for generating images, but it's awesome for dialing in the exact aesthetic you want with different nodes/settings. We built the Playground screen for generating images once you have a workflow you like.
Probably worth reworking this tagline, especially given your goal is to reduce the complexity of ComfyUI. Instead this makes it sound like it’ll have lots of settings or something.
gotcha thanks. any suggestions there? our goal is to have more levers than something like Midjourney, but without the complexity of comfy. Hard to convey this in a tagline lol
Take a look at OpenArt and how they're positioning themselves. They ditched their free and hobbyist users and now they're doing workflows for business and marketing use cases. Word on the street is that they're making an absolute killing. More revenue than giants like Leonardo, which is a full on drawing and animation suite.
You'd be surprised what making workflow easy can unlock in revenue potential. Look at how they're selling it and lean into that. I'd be doing it if our product was shaped like that.
Looking at your YC history, Rubbrband was initially meant to be "a way to train open-source Machine Learning models in just one line of code." I'm curious what this initial offering was and why you pivoted.
yeah we went through a few pivots actually - Rubbrband was initially a mobile app for musicians. It would listen to your practice sessions and give feedback. We pivoted out of this primarily because of issues with the technology and market.
>some major problems we’ve had with it. First is that it runs primarily on your own hardware
That's not a "major problem", that is the one most important core feature that anyone who is looking for full creative freedom and who doesn't want to enter "safety" censorship and surveillance hell should be looking for.
You can run ComfyUI with as little as 4 GB VRAM, which is not that much anymore.
Are you saying that the torch performance with MPS support enabled didn't meet your performance expectations? Or are you using an intel macbook and/or one with a tiny amount of ram/vram?
I run it on a 6 year old i5 box, no gpu, nothing. Granted, I'm fine with waiting 30 minutes for a generation, but indeed to be that's a core feature of the tool.
What's the power consumption on that ?seems like its more expensive to run than a gpu, but if you're not doing it often i guess it's not worth upgrading.
I have a similar setup: I'm running a Discord bot for family and friends that can do image generation. It's a 6-core AMD EPYC VPS with no GPU.
By using a SD 1.5 Latent Consistency Model (LCM) it generates an image in 3 steps or fewer. It takes about 30 seconds to generate a 512x512 image on that machine.
Man, commenters here are... grumpy. Yes it’s true you can do diffusion on a pi, if you’re okay waiting 4 years for your 50 steps to complete lol.
I want something like this to work, and am sure someone will eventually make the “photoshop of stable diffusion.” Personally my generations use custom fine-tunes and syncing 7gb checkpoints to a remote server can be a pain.
The bigger issue for Rubberband, I think, is that Civit weights are the killer app, and all Civit needs to do to win is make their generator a little more visible. Same with Civit’s trainer. They have an easy to use pipeline to go from training images -> generations, and they seem to run it close to cost, but it doesn’t even appear in the navigation until you sign in.
That doesn’t mean they’ll win, of course. Plenty of companies screw up, and maybe it’s not worth “winning” to begin with. See also: HuggingFace, which has Spaces, but is mostly used for the free bandwidth.
A hosted version of image generation could be useful to me because my hardware isn't very good. However, one of the main reasons to use local generation in the first place is privacy and anti-censorship. I don't see a TOS on your service. How does it handle those cases?
Similarly here - I clicked over and looked at the site and went first thing to find terms of use.. and nothing.
I have a couple of uses for such a thing, but perhaps you may offer one click deploy totally private use as you wish kind of thing,
I must assume a majority of people using comfyui or anything similar are going to be making stuff that is not allowed to be made on the more popular platforms.. your company may find itself quickly in the fire of many saying you should not be hosting X thing or Y thing or allowing people to do W or V.. and then there may be jurisdictions with whatever..
cloudlfare tried being a dumb pipe and espoused free speech until they did a 180..
So whtether it's investor pressure, or gov pressure or whatever it will be a thing at some point.
So also, what about export and backup options?
I would not create on another platform that can willy nilly pull the plug and leave you without your data, and the days of you pulling the plug are not far away imho.
Would use though, I hate setting this stuff up for myself, and I have a few other that can benefit from this sort of service - especially if it could be usage based at one price, and cold storage of training data at a lower price.
I don't play with SD anymore mainly because now i find it exhausting to keep up with the tools and tech and low prompt adherence when trying to generate something specific. It's still fun if you are happy with whatever it gives you without putting in much effort though.
"Play around with Rubbrband for free. No credit card required." yes but WHAT is the pricing if I like it? I need it upfront. Is it credits? Subscription? Pay-per-image? How does it work?
schopra909 | 1 year, 5 months ago
[OP] jrmylee | 1 year, 5 months ago
everforward | 1 year, 5 months ago
The neatest prompting tool idea I saw was one that used an LLM to do prompt refinement for the image, though the implementation I found was more antagonistic than helpful. It garbled my prompts into near-synonyms, giving me a basically equally crappy prompt but for a very different image.
The varied prompts are very neat too, where you give it a template string and a list of options to interpolate and it generates an image for each combo or random combos or etc. Very nice when you want to see the same image with different moods, lighting, backgrounds, etc.
stevekemp | 1 year, 5 months ago
[OP] jrmylee | 1 year, 5 months ago
42lux | 1 year, 5 months ago
Also offering workflows from your team that infringe on copyrights from the mouse is a bold move.
[1] https://openart.ai/ https://comfyworkflows.com/ https://comfy.icu https://cogniwerk.ai/ https://www.runcomfy.com/
[OP] jrmylee | 1 year, 5 months ago
We also have face IP-adapter nodes in our app!
I would say there are a lot of more minor UX differences about our app than other offerings. One particular favorite of mine is the ability to switch between the Node Editor screen and the Playground screen using cmd+p.
We built this feature mainly bc the node editor isn't great for generating images, but it's awesome for dialing in the exact aesthetic you want with different nodes/settings. We built the Playground screen for generating images once you have a workflow you like.
yunohn | 1 year, 5 months ago
Probably worth reworking this tagline, especially given your goal is to reduce the complexity of ComfyUI. Instead this makes it sound like it’ll have lots of settings or something.
[OP] jrmylee | 1 year, 5 months ago
block_dagger | 1 year, 5 months ago
echelon | 1 year, 5 months ago
You'd be surprised what making workflow easy can unlock in revenue potential. Look at how they're selling it and lean into that. I'd be doing it if our product was shaped like that.
[OP] jrmylee | 1 year, 5 months ago
kipukun | 1 year, 5 months ago
[OP] jrmylee | 1 year, 5 months ago
pizzafeelsright | 1 year, 5 months ago
Maybe the humans have been trained by AI to give AI a job?
Maybe the pivot was "gotta build something"
Imagine the controls on one line of code and how long is that line?
bozhark | 1 year, 5 months ago
elpocko | 1 year, 5 months ago
That's not a "major problem", that is the one most important core feature that anyone who is looking for full creative freedom and who doesn't want to enter "safety" censorship and surveillance hell should be looking for.
You can run ComfyUI with as little as 4 GB VRAM, which is not that much anymore.
[OP] jrmylee | 1 year, 5 months ago
spmurrayzzz | 1 year, 5 months ago
jokethrowaway | 1 year, 5 months ago
AuryGlenz | 1 year, 5 months ago
isoprophlex | 1 year, 5 months ago
lawlessone | 1 year, 5 months ago
isoprophlex | 1 year, 5 months ago
elpocko | 1 year, 5 months ago
By using a SD 1.5 Latent Consistency Model (LCM) it generates an image in 3 steps or fewer. It takes about 30 seconds to generate a 512x512 image on that machine.
https://civitai.com/models/4384?modelVersionId=252914
ekianjo | 1 year, 5 months ago
that's a feature not a bug
ed | 1 year, 5 months ago
I want something like this to work, and am sure someone will eventually make the “photoshop of stable diffusion.” Personally my generations use custom fine-tunes and syncing 7gb checkpoints to a remote server can be a pain.
The bigger issue for Rubberband, I think, is that Civit weights are the killer app, and all Civit needs to do to win is make their generator a little more visible. Same with Civit’s trainer. They have an easy to use pipeline to go from training images -> generations, and they seem to run it close to cost, but it doesn’t even appear in the navigation until you sign in.
That doesn’t mean they’ll win, of course. Plenty of companies screw up, and maybe it’s not worth “winning” to begin with. See also: HuggingFace, which has Spaces, but is mostly used for the free bandwidth.
Anyway, best of luck!
[OP] jrmylee | 1 year, 5 months ago
CaptainFever | 1 year, 5 months ago
[OP] jrmylee | 1 year, 5 months ago
stevenicr | 1 year, 5 months ago
I have a couple of uses for such a thing, but perhaps you may offer one click deploy totally private use as you wish kind of thing,
I must assume a majority of people using comfyui or anything similar are going to be making stuff that is not allowed to be made on the more popular platforms.. your company may find itself quickly in the fire of many saying you should not be hosting X thing or Y thing or allowing people to do W or V.. and then there may be jurisdictions with whatever..
cloudlfare tried being a dumb pipe and espoused free speech until they did a 180..
So whtether it's investor pressure, or gov pressure or whatever it will be a thing at some point.
So also, what about export and backup options?
I would not create on another platform that can willy nilly pull the plug and leave you without your data, and the days of you pulling the plug are not far away imho.
Would use though, I hate setting this stuff up for myself, and I have a few other that can benefit from this sort of service - especially if it could be usage based at one price, and cold storage of training data at a lower price.
random thoughts.
[OP] jrmylee | 1 year, 5 months ago
lovegrenoble | 1 year, 5 months ago
[OP] jrmylee | 1 year, 5 months ago
smusamashah | 1 year, 5 months ago
I don't play with SD anymore mainly because now i find it exhausting to keep up with the tools and tech and low prompt adherence when trying to generate something specific. It's still fun if you are happy with whatever it gives you without putting in much effort though.
mdrzn | 1 year, 5 months ago