Understanding Indie Search Engines

Indie Search Engines

What is Teclis?

ArchivedRead

Marginala

Kagi is a privacy-focused, user-centric search engine. Great search experience starts with Kagi!

ReadArchived

Wiby is a search engine for older style pages, lightweight and based on a subject of interest. Building a web more reminiscent of the early internet.

ArchivedRead

Find a web page made by an IndieWeb community member.

ReadArchived

Tools

Web scraping library and command-line tool for text discovery and extraction (main content, metadata, comments) - GitHub - adbar/trafilatura: Web scraping library and command-line tool for text dis...

ReadArchived

Headless Chrome Node.js API. Contribute to puppeteer/puppeteer development by creating an account on GitHub.

ReadArchived

A standalone version of the readability lib. Contribute to mozilla/readability development by creating an account on GitHub.

ReadArchived

Lightning-fast, open source search engine for everyone

typesense, search engine, fuzzy search, typo tolerance, faceting, filtering, app search, site search, search bar, algolia, elasticsearchArchivedRead

You can install it using pip:

ArchivedRead

FastAPI framework, high performance, easy to learn, fast to code, ready for production

ArchivedRead

Google Research. Contribute to google-research/google-research development by creating an account on GitHub.

ReadArchived

A motivating factor is the search engine has sort of grown to a scale where it's becoming increasingly difficult to productively work on as a personal solo project. It needs more structure. What's kept me from open sourcing it so far has also been the need for more structure. The needs of the marginalia project, and the needs of an open source project have effectively aligned.

ArchivedRead

Specific Search & Recommendation Platforms

Blog Surf is the internet's only search engine for blogs. Explore the best writing on the internet.

ArchivedRead

An open index of well-known resources.

ReadArchived

TinyGem is a bookmarking service, that automatically uses the links you save to surface other related content from manually curated sources. If you are intelectually curious, have a selective news diet and enjoy reading places like Hacker News, TinyGem might be for you.

ReadArchived

Corpuses

Us

ArchivedRead

The HTTP Archive Tracks how the web is built by periodically crawl the top sites on the web and record detailed information about fetched resources, used web platform APIs and features, and execution traces of each page.

ArchivedRead


A guide for how to discover cool things on the internet.

ArchivedRead

Hello. I was going to write a post about how to surf the web only I remembered it had already been written, in a far more comprehensive format, by another person. So I'm just going to link to it and…

ArchivedRead

Meilisearch is neat together with their tokenizer lib they use. More practically DocSearch is great for plug and use solution. Tantivy, Quickwit & Edgesearch are interesting too.

ArchivedRead

This article is a stub. You can help the IndieWeb wiki by expanding it.

ArchivedRead

Hey nerds: I recently stumbled across “Marginalia Search”. It’s a search engine with a fascinating design — rather than give you exactly what you’re looking for, it tries to surprise you.

ArchivedRead

Indie Map is a complete crawl of 2300 of the most active IndieWeb sites as of June 2017, sliced and diced and rolled up in a few useful ways:

ArchivedRead

🍵️

ArchivedRead

The way to improve search is not to mimic Google, but instead to build boutique search engines that index, curate, and organize things in new ways.

ArchivedRead

bookmark

ArchivedRead