Topic Forums

p/general

Product Forums

p/carrot-3

GPT3 for computer vision

Visit Product

Carrot — GPT3 for computer vision

Kyle Morris

Featured

•

3yr ago

Carrot is a model built by Plaintain Labs and hosted on Banana.dev
The inspiration came from seeing general purpose text models like GPT3 and asking "can we do this for computer vision too?"

Replies

Best

Rosie Higgins

Grapevine

hi I tried the demo, is there a way to upload another image rather than the default to test the model out?

3yr ago

Sahil Chaudhary

Img2Prompt

Maker

Hey! You can change the imageURL in the modelParameters to any image url you want to use, or better would be signing up for the api key to get unlimited use

3yr ago

Rosie Higgins

Grapevine

@csahil28 nice thanks! I tried with a few different images and it worked pretty well :D But for some reason it gives an error with this image:

3yr ago

Sahil Chaudhary

Img2Prompt

Maker

@rosie_higgins_grapevine You can send me the image url at sahil@banana.dev and I can check what's causing the error

3yr ago

Sahil Chaudhary

Img2Prompt

Maker

@rosie_higgins_grapevine I see, the issue is we get a 403 error when we try to access the image for inference

3yr ago

Derek Pankaew

Listening.io

This is AWESOME. Is there any chance I can input an image of a bank statement, and have it extract the text from the bank statement?

3yr ago

Erik Dunteman

Shoot YOUR Shot - AI Edition

Maker

@derekpankaew the model has some OCR capabilities which has impressed us (especially the math example), but the OCR isn't consistent enough where I'd lean on it for financial matters

3yr ago

Kyle Morris

Shoot YOUR Shot - AI Edition

Maker

Hi all 👋! Makers Kyle / Sahil / Erik here! We are ML developers from Banana.dev that spun out into an ML research lab: PlantainLabs We were inspired by general purpose text-based models like GPT3 that launched over a year ago and asked ourselves: Can we do something like this for computer vision? So we launched Carrot! With carrot you can generate state-of-the-art captions of images, ask questions to images, calculate image-text similarity, and more. Try it and let us know what you think!

3yr ago

Vitalie.

I am always curios about products that uses GPT3 and AI. The product you guys build is great and hope to grow as you go. Congrats on launch 🚀

3yr ago

Sahil Chaudhary

Img2Prompt

Maker

@vital1188 Thanks!

3yr ago

Lena Krzemieniewska

Cool thing! I admire every product made with GPT-3, it's such a magic and complex thing! Guys, ale you open for integrations? In digitalfirst.ai were looking for a AI text generator to integrate right now, we'd be also interested in train our own models in the future. If you're open for a quick talk let me know! (lena@digitalfirst.ai)

3yr ago