• Subscribe
  • Do you feel that GPT-4 is getting worse?

    Mehdi Rifai
    29 replies
    Recently I found myself being frustrated at GPT-4 and its lack of understanding to simple task like summarize this article from a link. I often get a "I can't accomplish this task". Do you witness similar situations? How do you cope with it?

    Replies

    Jacky Wong
    Yeah - it's gotten a lot worst unfortunately.
    Share
    Sathish Nagarajan (SNR)
    😂 exactly it has become like humans.. if I give it a task, it gives me back the same task in a different way
    Share
    Frank Sondors
    Try Gemini
    Share
    Thomas Hallaran
    Objectively it is getting worse!
    Share
    Jamie L
    AI Desk by Collov AI
    AI Desk by Collov AI
    I've noticed GPT-4 can sometimes stumble on tasks like summarizing from a link, possibly due to the way it processes external content. When I encounter this, I usually extract the key points manually and then ask GPT-4 to summarize based on that information to ensure it stays within its operational parameters.
    Share
    Jamie L
    AI Desk by Collov AI
    AI Desk by Collov AI
    I've noticed GPT-4 can stumble at times, Mehdi, especially with tasks that require real-time web interaction which it's not designed for. When it hits a snag, I pivot to using it for brainstorming or drafting outlines, leveraging its strengths in creativity and content generation.
    Share
    Pallavi Ganpat Babar
    I've never used ChatGPT-4, but I believe that its performance will vary depending on individual experiences and expectations. It's also important to consider that newer versions of AI models may still be undergoing refinement and improvement over time.
    Share
    Vincent Xu
    AI Researcher
    AI Researcher
    I've noticed GPT-4 can sometimes stumble on tasks that seem straightforward, likely due to the nuances of language processing or current limitations. When I encounter this, I try to rephrase my request or break down the task into simpler components, which often helps clarify the intent.
    Share
    Mehdi Rifai
    @cen_xu do you use specific prompt structures?
    Swayam
    I just ask a staff member to get the task done for me because i might just smash the screen if i spent more time on it
    Share
    Aris Nakos
    Has anyone actually measured performance degradation "objectively" here ? Think response quality vs inference time -- let response quality be something simple that you defined, such as JSON completeness.
    Share
    Dzmitry Tsemirau
    The same with me. Sometimes I think it's making fun of me.
    Share
    Darrell M. Dengler
    Yes it going very worst
    Share
    Peter Horvath
    I think it depends a lot on your prompts. Also, due to the updates, slight differences can happen in the outputs for the same prompt. Even minor things / details can influence your output significantly. I’ve generated approx 16M words with GPTs in the past 3+ years (including the previous versions), so I got this from first hand experience.
    Share
    Atticus Li
    I have been using GPT since when it first commercialized it. It gets worse, then it gets better. It comes in cycles. This is why my team are building our own ML models and fine-tuning it to avoid depending on OpenAI
    Share
    Mehdi Rifai
    @atticusli do you plan on commercializing your model?
    Atticus Li
    @mehdi_rifai Yes, we will be launching our product in about a month! Here is what we have built: https://try.jobsolv.com/waitlist/
    Share
    Nadia Zueva
    Launching soon!
    In terms of, for example, coding tasks, absolutely not; it's fortunately getting better and better. However, I've noticed that developers have limited ability to check information from links. I believe this was implemented for security purposes. Did you tried to upload an article as attached file?
    Share
    Mehdi Rifai
    @nadiaaesty that's what i'm starting to do. Pasting the entire article in he prompt instead of asking to check the link
    Adams Aimé-Désiré
    I eared some people talk about it but i did not feel it. I think in they last update, OpenAi said something about that, and they are working on it
    Share
    Shambhavi Mahajan
    i really like claude more
    Share
    Francesco D'Alessio
    Tool Finder - Find Productivity Tools
    Tool Finder - Find Productivity Tools
    Yes, then this last few days it has kicked in.
    Share
    Sergei Petrov
    I haven't noticed any changes yet. Perhaps it's a matter of prompts?
    Share
    Prem Saini
    Done using GPT-4, Try gemini been using it a lot lately!
    Share