comments (10)

  • What is your content/data license? I don't see anything about this on the site. For something to feel like a community wiki, the community needs to co-own the content and be able to fork. If you think the content is in the public domain because of AI, applying a license like CC BY or CC BY-SA won't hurt, but the content is copyrighted, not applying a license will. (This isn't legal advice.) See "WP:CRANDO" (https://en.wikipedia.org/wiki/Wikipedia:Copyrights#Contribut...) for how Wikipedia does it.

    networked

  • 0 for 10 on some startups (large and small, YC and not) that came to mind.

    It's easy to scrape YC startups from https://www.ycombinator.com/companies. Scrape that and a dozen other investors' portfolio pages and you'll have a useful fraction of startups.

    tlb

  • Hello ! Great project. Do you plan to make it open source as it is already free to use ? If so already, I didn't find the github repository.

    Calgaryp

  • I expected the VERIFIED badges to link to some sort of provenance information. That seems like a must, otherwise (given the "assume everything's incorrect" disclaimers) I'm not sure why one would take that badge seriously.

    CharlesW

  • It's a good idea. Why not ask startups to upload a startup.txt (as opposed to robots.txt) to their web root and collect from that? Pre-filled text forms can be downloaded. Also, as with CB, collect data on individuals through a similar opt-in. Enable users to ping your site when it's ready to collect.

    You could have a "traction" stat and ask for a JS snippet be installed on homepages or a set of pages. Old school and unreliable. Registered users is also a good way to assess traction. Not sure how that information could be readily obtained.

    In my previous comment I mentioned attaching a crypto address to domains - you could do that too. That'd be interesting. One feature you could add long-term is crowdfunding. Either for new features, code releases, media, documents - whatever.

    Crowdfunding activity on startups and individuals would be a great way to measure traction.

    adrianwaj

  • It sounds like none of the data will be reliable? Ai and community seems like very little will be true and I will have no idea which part will be true.

    dgrin91

  • Build trust, collect data from cdrowdsource, if you want to succeed on this.

    Build trust by: truly making this a public good, by open sourcing it. Be the maitnainer. Data dump every week as a zipball/tarball. These will ensure you can't rugpull.

    With this trust, offer an extension (open source of course) to all, which whever a user goes through crunchbase, traxn, etc, sends any factual data (hence non-copyrightable) to you. If you gained trust, I would also do this.

    You get the right to be a maintainer, and figure out if you also want to make a business with it on top.

    clapthewind

  • Really cool concept but so much of the data is wrong. Anthropic ARR is an order of magnitude higher, Replicate did a Series B as well which is not mentioned. There is probably a lot more.

    zopper

  • Would you consider allowing people to login with OpenRouter?

    https://openrouter.ai/docs/guides/overview/auth/oauth

    Would be a good way to have others absorb some of your inference limits and fill in missing data that they need. A call to action on a blank search would be a great flow.

    wett

  • How about expose an API so that users can put the name of a startup and it goes through your AI agent pipeline to acquire an estimate? That way, you don’t need to know every startup under the sun and focus on optimizing your pipeline instead.

    chaidhat