Start new thread

GPT-J - Open-source cousin of GPT-3, everyone can use it

Aditya

Product Hunt

•

3yr ago

GPT⁠⁠-⁠J-6B, a 6 billion parameter model trained on the Pile, is now available for use with our new codebase, Mesh Transformer JAX.

Replies

Best

Raju Akon

How to Use it?

Report

3yr ago

Aditya

Product Hunt

Hunter

@raju_akon super simple. Just visit https://6b.eleuther.ai/?ref=prod... write a prompt (the topic/reference/plot) on which you want an output, and then click on 'Run the model' 😍

Report

3yr ago

Blake Hunsicker

Bookmarks

This looks really promising. Does GPT-J have a token limit like GPT-3 does?

Report

3yr ago

Aditya

Product Hunt

Hunter

@blakehunsicker yes, it can be fine-tuned at a rate of ~5000 tokens/second, which should be sufficient for small-to-medium-size datasets. Fine tuning instructions are here: https://github.com/kingoflolz/me...

Report

3yr ago

Aditya

Product Hunt

Hunter

Yep! Doors have been OPENED 🤯 An open-source cousin of GPT-3 is here 😇 - Performs on par with 6.7B GPT-3 - Performs better and decodes faster than GPT-Neo - repo + colab + free web demo Got to know about it through Towards Data Science article: https://towardsdatascience.com/c... More details in @arankomatsuzaki's article: https://arankomatsuzaki.wordpres...

Report

3yr ago

Eugene Hauptmann

LLC Toolkit

@arankomatsuzaki @adityavsc omg yes!!

Report

3yr ago

SaaS Blocks by Apideck

@arankomatsuzaki @adityavsc amazing metrics 🤯

Report

3yr ago

Mayank Mishra

Poppins

Launching soon!

this is super good for folks to get started with GPT3 @adityavsc

Report

3yr ago

Aditya

Product Hunt

Hunter

@mishra_mayank absolutely!

Report

3yr ago

Pal

Fab find @adityavsc Thanks! How safe has GPT-J been in your usage - any issues of negative sentiments, foul language or worse?

Report

3yr ago

Aditya

Product Hunt

Hunter

@pallpakk some results were definitely weird but overall, it works great! Negative sentiment, foul language, etc are context specific outputs. So if an input is negative/abusive itself, the output is bound to reinforce the same sentiment.

Report

3yr ago

Pal

@adityavsc Thank you for clarifying. I'm definitely looking into this some more!

Report

3yr ago

Ajeya

Ricotta

🔥🔥🔥🔥🔥

Report

3yr ago

Nik Hazell

Zappi Ad Predictor

Amazing! Nice one for hunting this down @adityavsc!

Report

3yr ago

Aditya

Product Hunt

Hunter

@nik_hazell thank you! Anything for open-source 😉

Report

3yr ago

Girdharee Saran

Useful and easy, good luck on launch

Report

3yr ago

Matt Gordon

This Song Plants Trees

Will try this out

Report

3yr ago

Swebliss

Wow this is amazing. Thank you SO much. Is there any way to DM you and ask you something? :)

Report

3yr ago

Ankit Sharma

awesome bro. keep going. It is much needed. Congo 👍👍👍

Report

3yr ago

Aditya

Product Hunt

Hunter

@ankitsharmaofficial all credit goes to the open-source community :)

Report

3yr ago

Nassim Abd

linke

Great, we were waiting for this

Report

3yr ago

Aditya

Product Hunt

Hunter

@nassc for so long! Finally the wait is over :)

Report

3yr ago

Dillon Peterson

Notion2Email

WOAH!! Way to go guys! Thank y'all for putting this together, really amazing.

Report

3yr ago

Aditya

Product Hunt

Hunter

@dillon_peterson all credit goes to the wonderful open-source community :)

Report

3yr ago

Fateh BENMERZOUG, Ph.D

This just blew up the door of textual content generation, awesome!

Report

3yr ago

Aditya

Product Hunt

Hunter

@fateh_benmerzoug IKR! This is literally giving super-powers to the Makers 🚀

Report

3yr ago

Kelvin Zhao

Lila

Will an api of this be created?

Report

3yr ago

Mustafa Al-Adhami

Eugris

Awesome

3yr ago

🔥🔥🔥

3yr ago

Looks really interesting

Report

3yr ago

Pascal Weinberger

Bardeen

the world needs this :)

Report

3yr ago

Patrick Hamelin

*GPT-J is just as good as GPT-3.* It is more efficient, but with more quirks. In our JPRED scores, it did better with simple TCS tasks, but lost with the more complex tasks. By removing the Jordan Algorithm: Our next proposed change to a probability model is removing the Jordan Algorithm. The Jordan Algorithm is a special procedure used for simple TCS tasks that allows for fast analysis of different sequence pairs, as well as being able to easily analyze simple n-gram (aka word) models.It is more efficient, but with more quirks. In our JPRED scores, it did better with simple TCS tasks, but lost with the more complex tasks. ...

Report

3yr ago