Crawlify
p/crawlify
AI powered data extraction APIs. Hassle-free data retrieval.
Sandra Ionescu
Crawlify β€” AI powered data extraction APIs. Hassle-free data retrieval.
Featured
40
β€’
Put your web data extraction on autopilot with AI technology that retrieves clean, structured data accessible via API. No coding, selectors or any manual input needed.
Replies
Brad Ungar
Would there ever be a way to migrate robots from other services? We use Dexi but biggest fear long term is not being able to move robots if anything happens to them
Sandra Ionescu
@bradungar Hey Brad, we work towards eliminating manually configured robots or crawlers. The algorithms we develop browse pages as humans do and tries to identify datapoints of interest. At the moment we provide APIs to extract data from ecommerce pages, blogs, job pages and real estate listings.
Pratik Khandagale
Not really working for the page which I wanted to look out for! Like it is stuck.
Sandra Ionescu
@allpratik Can you please share that specific page with us ? It would help us tremendously in improving the accuracy - thanks!
Emmanuel Kaska
Curious if its possible to use this to extract data from social media sites like linkedin & instagram. Interested in analyzing hashtags & groups.
Sandra Ionescu
@emmanuel_kaska We already have a few clients extracting data from the platforms you mentioned. Reach out on our website and we will talk specifics.
Eric Axelrod
Very cool @sandra_ionescu . I definitely want to explore this
Sandra Ionescu
@eric_axelrod Thanks, we really appreciate it. Let us know if you have any questions a lot of our are curently in closed beta.
Sandra Ionescu
Hello hunters! At scale, websites change and crawlers break daily. Maintaining them is time consuming and painful. Crawlify is on a mission to help companies of all sizes automate their data extraction at scale. To achieve this, the platform fuses machine learning with natural language processing to retrieve clean, structured data without the need for manual rules or site-specific training. You can try it for yourself on our website. The current APIs are: 1. The Product API [Released] - it automatically extracts product data from any e-commerce page. 2. The Real Estate API [closed beta] - it automatically extracts data from real estate listings with other data points like crime & safety statistics, monitoring foreclosure/auctions listings, or urban planning and construction permits. 3. The Job Page API [close beta] extracts clean job listing data from any board. 4. The Article API (released) - enables the extraction of articles data from pages without an RSS feed.
Jeremy Crisp
@sandra_ionescu please confirm if this is still a live project or not, thanks!
Barnea Florin
Played around with the demo on the website and it looks really good. Can't wait for the real estate API.
Sandra Ionescu
@barnea_florin Thanks for the feedback. The real estate API is currently in beta, we plan to launch it to the public next month. Please subscribe on our website to get notified when we do so.
Conor Clarke
Looks really good, will try out the free version first! Left a few thoughts here too as I was looking through your landing page incase its helpful - https://app.usebubbles.com/45da6...
Sandra Ionescu
@conor_clarke1 I can't access the link, can you please share it again? We are aware that our landing page needs some improvements :d therefore we are curious and thankful for your thoughts. Looking forward to read them
Enrico Faccioli
Great stuff, looking forward to testing the real estate API! Let me know if you need one more beta tester, happy to help :)
Jatin Chaudhari
Congratulations on the launch!
Lena Arnaud
I believe there is Diffbot that also offers similar APIs. Curious to know any differences. Thanks.
Wayne Smallman
Hi @lenaarnaud, Diffbot appear to have more products, and Extraction is of particular interest to me.
Francisca Aguayo
Tested as well on different rage of websites. The accuracy seems top notch. On a single page it did not report the correct availability.
Sandra Ionescu
@francisca_aguayo Hey Francisca, thanks for sharing your feedback with us. Can you point us to that specific page? It will tremendously help us in improving our accuracy.
Daniela Quiroz
Tried it out on several e-commerce websites from different countries. The results are accurate so far. It would be really handy for affiliate websites if you could provide an embed widget for any product to display details like price and title.
Sanda Gorcea
@daniela_quiroz I second that. I have a small review website in my native language and no affiliate pluginsare available for my country.
Sandra Ionescu
@daniela_quiroz Thank you for your feedback! We plan to develop integrations with shopify and woocomerce in the next weeks.
Olivia Belgea
Is there a demo available for the Jobs API?
Sandra Ionescu
@olivia_belgea We can set that up for you. Please schedule a meeeting with us on our website. Thanks!
Virgil Deckow
@sandra_ionescu created a trial account! The data extracted so far is accurate!
Virgil Deckow
@sandra_ionescu looking through results are you going to support extracting product attributes in the future for the product api?
Sandra Ionescu
@virgil_deckow We are already working on that. On the next iteration of the product, clients can extract more data points including product attributes.
Damien Tress
Did a few smoke tests using the Product API. Satisfied of the results so far. What load and concurrency the endpoint would reasonably handle? We have an use case in which we need to get pricing data across multiple e-commerce websites in a very short time frame.
Sandra Ionescu
@damien_tress We use Google Cloud Run to handle requests using a serverless architecture. We scale automatically up and down depending on load. Therefore, we can handle any enterprise use case. We can set up up a custom trial account so you can test our API at the scale you need.
Matheo Noel
Found this on Reddit. I am working with a non profit to monitor used books bulk sales on several websites. I like the results I see so far. I've tried several other providers but they were a bit cost prohibitive. Do you think you can offer some free credits to power this use case?
Hans O'Conner
What data are you extracting for the Jobs API? Can the product extract data from an entire site?
Sandra Ionescu
@hans_o_conner At the moment we extract: Title, Description, Salary, Location recruiter contact info if available. Extracting data from an entire website is on the roadmap for the next iteration.
Verla Frami
Currently, working a lot with product data to track some information of interest. Have you thought about adding an integration with Google Spreadsheet? It would update data directly on the spreadsheet page.
Sandra Ionescu
@verla_frami It s on our roadmap for the next iteration. As an ETA we will launch the second version in 1 month.
Wes Drumheller
Your website and/or service appears to be down. I clicked on your website button but no go...
Jeremy Crisp
Is this project (Crawlify.II) still live or has it been discontinued? Please can someone confirm as I had initial conversations not long after this PH launch but now I want to use them, no one’s answering anymore. Thanks πŸ™πŸΌ