StarSearch is our AI-powered feature that provides in-depth insights into contributor history and activities, bringing transparency and a new depth of knowledge about open source projects.
We're beyond excited to introduce StarSearch to you all. Our newest feature at OpenSauced!
In 2022, I worked at GitHub and consistently got feedback from open-source maintainers that theyโd love more insights. This is also true for builders who depend on open source to ship and iterate quickly. Founders are choosing open source, but few know who to connect with for expertise.
๐ Why StarSearch?
In the 90s, I watched a show on television at my grandma's house called StarSearch. This same show discovered Beyonce, Ryan Gosling, and Justin Timberlake before they were stars. This new and complementary feature is live in our OpenSauced product today, and we have indexed millions of GitHub events and are excited to get your feedback on the approach.
๐ Track impactful developers from open source projects
๐ Discover new projects in open source
๐ Identify your next core contributors
Using StarSearch, you can find up-and-coming contributors through trends in public open-source data, which can set you apart from the rest when building. We love open source and want more folks connected to the who's who in this space. StarSearch is our first exploration of adding semantic search to fetch relevant context on the projects used, built, and depended on.
๐ Try it out and run the following test prompts https://oss.fyi/starsearch
Who are some Tailwind experts who also have an interest in Rust?
Who are the prevalent contributors in the Typescript ecosystem?
What pull requests has charliermarsh worked on recently?
๐ Thank you for exploring StarSearch! We've invested our passion and creativity into this; your feedback is invaluable. Dive in, engage, and know we're here for any questions or chats about the wonders of OpenSauced.
๐ We're all ears for your feedback below! ๐
@ena_gluhakovic, that is an amazing question. Right out of the gate, the band name is "Looks good to me (LGTM)," and the debut album would of need to be "Hello World"
@bdougieyo Thatโs fantastic! "LGTM" is a brilliant band name, and "Hello World" is the perfect debut album title for a group of open-source rockstars. Can't wait to see their "Release Notes" tour dates! Thanks for the creative and fun response! ๐ธ๐
@bdougieyo@bekahhw Yeaah, definitely has a nice ring to it! First thing that came to my mind "Syntax Error & The Debuggers". I think this is such a cool game, thank you guys for playing ๐ธ๐ผ
Congrats on launching StarSearch. Itโs the Github Contributor Tab on steroids.
This opens up so many opportunities for finding the right person for the right task.
Even just searching for specific packages or components, I can easily identify the persons who are instrumental in the development.
@mfts0 Thanks for checking it out, Marc. We have so many more things planned, but we had to limit ourselves so we could ship. I would be happy to hear any specific insights you like to see from the prompts or if any edge cases arise.
his StarSearch feature sounds interesting, it can help me better understand the contributors in open-source projects. I'm looking forward to trying it out and discovering new contributors.
i had a lot of fun working on the user interface (UI) for StarSearch. One exciting challenge that I tackled was loading dynamic widgets into a StarSearch response depending on the StarSearch response returned. I think I came up with an elegant solution, but Iโm always welcome to feedback. ๐
Itโs still early days, but weโll be adding more dynamic widgets over time now that the logic is in place in the code base.
Stay saucy peeps!
@bdougieyo thanks! I went for a long walk last night, like I usually do, and when I got back, I was like, letโs see if I can do this thing we chatted about a couple of weeks ago. ๐๐
@nickytonline couldn't have made this possible without your incredible work! And appreciate everything that we are still shipping, right this very minute! ๐ซก
It's been really cool to watch the process and growth of this feature. We've been able to identify information that wasn't readily available before. It was there, but you would have to dig deep and spend a lot of time to find the answers for it. I'm a big fan of community and building bridges and I think StarSearch offers a new way to connect people.
It also offers a great way to support people and projects as well. You can ask questions like, "Is the Laravel project at risk?" It'll let you know what its lottery factor (risk if a top contributor leaves) is and who the top contributors are. If a project is at high risk because a top contributor is putting in most of the contributors, that's an opportunity to support their continued work.
I'm looking forward to seeing the continued development of StarSearch!
@bekahhw lottery factor is something I'm really passionate about: I've experienced it first hand and have had to pick up the pieces after critical maintainers have left projects. And I couldn't agree more, so glad we were able to bring this to the platform and give these capabilities to StarSearch!
Amazing Work @bdougieyo@nickytonline@bekahhw and team! I showed this to a few product managers at my company and they loved it. Opens new possibilities for non-tech folks to estimate productivity and maybe even hire through open source instead of the standard coding interviews!
@bdougieyo@bekahhw@diivii thanks for the kind words and I ๐ฏ agree about giving non-technical and technical people superpowers for their tool belt. Not all heroes were capes! ๐คฃ
A few months ago, @bdougieyo came to me with what sounded like a big, crazy, amazing idea ๐ค
We have ALL this public unstructured GitHub data. We've ramped up our data-pipelines to operate at GitHub's incredibly huge scale. We've even invented nuanced and unique metrics that can give very good insights into open-source communities.
But what if we enhanced those mechanisms to work with AI capabilities?
โจ StarSearch is the result of months of work experimenting, living on the bleeding edge of AI technologies, and bringing a huge scale AI system to production. Ask StarSearch about projects you care about, things you're trying to work on, the Lottery Factor, and see what sorts of unique discoveries you can make!
๐ง I built the backend and data pipelines that make all this possible: StarSearch is a complex, multi-agent RAG (retrieval-augmented-generation) that has access to droves of our pristine GitHub data and other services. Several micro-services working together make this all possible: the "pizza-time" service consumes realtime GitHub events from the public GitHub events API. This populates the metrics, histograms, and time-series charts you see today on OpenSauced.
Next, we have something called the "star-search-embedder" that looks for rows of data in our database that match the criteria we've set. It takes batches of those rows and embeds them into our vector store using a text embedding model. But first, we generate an AI summary on this unstructured texts in order to scrub them of any "messy" metadata or stuff we don't really need to embed. This is accomplished at-cost using a few GPUs on Kubernetes, vLLM, and open-source AI models.
Then, within our API, we use a multi-agent architecture for our LLMs to enable a powerful question and answer engine. The agents have access to several of our other services and can "call" each-other to get necessary information from one agent to the other. For example, our internal "Bing Search AI Agent" knows how to interface with the Azure Bing search service, can generate summaries on mass amounts of raw, unstructured search results, and can hand those back to a calling agent that need made the query. More importantly, these agents can perform a vector-search on all those vectors we've embedded through our data pipelines. Today, we have over 100m+ vectors that power this engine and enable StarSearch to search through pull requests and issues.
๐ One of my favorite queries to use is a sort of "needle in the hay-stack" search. Let's say I'm working at a product team shipping a Kubernetes platform. And I need to upstream some changes to how persistent storage volumes are claimed in the upstream kubernetes/kubernetes project: that's a very, VERY heavy lift if you've never worked in that community before! Where to even start!? What groups are involved? Who would be best to reach out to for feedback in the community? Where is the prior art? Asking StarSearch questions like "Who has worked on persistent volume claims in kubernetes/kubernetes?" can start to give you an idea of who those people might be, unlocking you to gain community support on a huge undertaking!!
๐ Would love to hear any and all feedback!! Thanks all and stay saucey!
Congrats on the launch @bdougieyo ! I just queried for some of the top contributors for one repo I was interested in and got good insights. I like how it not just shows the list, but a glimpse of their contributions, and an insight at the end
@sudhamabhatia, I appreciate you testing it out. We have a PR open to add some extra functionality soon. It should get merged later today, let us know if you have any other feedback. https://github.com/open-sauced/a...
I normally spend hours in my week hunting for guests on CodingCat.dev podcast. Now I can find people so much easier.
For example I gave this prompt today:
"Who can I interview on my podcast CodingCat.dev about typescript projects that are trending"
And got back:
Here are several contributors to trending TypeScript projects that would make excellent guests on your podcast:
@anonrig:
Project: getsentry/sentry
Contribution: Added more TypeScript rules to improve type checking and code quality.
Summary: Anonrig submitted a pull request to enforce stricter TypeScript rules in the Sentry codebase, adding the @types/jest and ts-jest packages, and creating a new test suite.
@Lightning00Blade:
Project: puppeteer/puppeteer
Contribution: Improved test reliability and added TypeScript types.
Summary: Lightning00Blade worked on reducing test flakiness and adding TypeScript types, improving the development experience for those using TypeScript with Puppeteer.
@ScreamZ:
Project: amannn/next-intl
Contribution: Updated documentation for TypeScript usage.
Summary: ScreamZ enhanced the next-intl project documentation, offering better guidance on how to use TypeScript within the project.
@jakebailey:
Projects:
microsoft/TypeScript-Website
microsoft/TypeScript
Contributions:
Forced TypeScript to emit .d.ts files and linked changes with Microsoft's monaco-editor.
Updated dependencies, switching back to TypeScript's bundler from esbuild.
Summary: Jake Bailey has been active in both updating dependencies and linking TypeScript changes, focusing on improving the TypeScript development environment and its tooling.
You can reach out to these contributors to share their insights and experiences on working with TypeScript on trending repositories.
Starsearch is a pretty cool feature! I can definitely see myself using it in the future when I want to maintain my own projects. Congratulations on the launch @bdougieyo
@bonvisions we've just launched the product, but there's wide application for anyone in open source, IMO. As it stands today, the open source ecosystem can feel pretty disconnected. Theoretically, we know that there are maintainers, contributors, and projects out there that are incredibly talented and innovative, but finding them can feel impossible. StarSearch makes this easier. And with 90% of companies using open source software, it's definitely useful to understand who is keeping open source projects alive or has the experience to be able to answer questions. This is just the beginning.
@bonvisions, thats a good question. I spent the last two years working alongside engineers and customers, building our first product, OpenSauced. Our feedback from there is that our customers ask the same questions about open source monthly. I can show you how this hit 100m ARR and clear market size in a pitch deck, but the TBH hat is the secret sauce.
Replies
OpenSauced
OpenSauced
OpenSauced
OpenSauced
Papermark
OpenSauced
OpenSauced
Brainstory
OpenSauced
OpenSauced
OpenSauced
OpenSauced
Tidyread
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
Hashnode
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
Track
OpenSauced
OpenSauced
OpenSauced
OpenSauced
Appwrite
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
OpenSauced
Daytona
OpenSauced