Understanding Indie Search Engines

Indie Search Engines

What is Teclis?

ArchivedRead

Marginala

Kagi is a privacy-focused, user-centric search engine. Great search experience starts with Kagi!

ReadArchived

Wiby is a search engine for older style pages, lightweight and based on a subject of interest. Building a web more reminiscent of the early internet.

ArchivedRead

Find a web page made by an IndieWeb community member.

ReadArchived

At Mojeek we like to do things differently, that's why we're building a search engine that respects your privacy whilst providing unique and unbiased results.

ArchivedRead

Ecosia uses the ad revenue from your searches to plant trees where they are needed the most. By searching with Ecosia, you’re not only reforesting our planet, but you’re also empowering the communities around our planting projects to build a better future for themselves. Give it a try!

ecosia, green, search, engineArchivedRead

Tools

Web scraping library and command-line tool for text discovery and extraction (main content, metadata, comments) - GitHub - adbar/trafilatura: Web scraping library and command-line tool for text dis...

ReadArchived

Headless Chrome Node.js API. Contribute to puppeteer/puppeteer development by creating an account on GitHub.

ReadArchived

A standalone version of the readability lib. Contribute to mozilla/readability development by creating an account on GitHub.

ReadArchived

Lightning-fast, open source search engine for everyone

typesense, search engine, fuzzy search, typo tolerance, faceting, filtering, app search, site search, search bar, algolia, elasticsearchArchivedRead

You can install it using pip:

ArchivedRead

FastAPI framework, high performance, easy to learn, fast to code, ready for production

ArchivedRead

Google Research. Contribute to google-research/google-research development by creating an account on GitHub.

ReadArchived

A motivating factor is the search engine has sort of grown to a scale where it's becoming increasingly difficult to productively work on as a personal solo project. It needs more structure. What's kept me from open sourcing it so far has also been the need for more structure. The needs of the marginalia project, and the needs of an open source project have effectively aligned.

ArchivedRead

Specific Search & Recommendation Platforms

Blog Surf is the internet's only search engine for blogs. Explore the best writing on the internet.

ArchivedRead

An open index of well-known resources.

ReadArchived

TinyGem is a bookmarking service, that automatically uses the links you save to surface other related content from manually curated sources. If you are intelectually curious, have a selective news diet and enjoy reading places like Hacker News, TinyGem might be for you.

ReadArchived

Corpuses

Us

ArchivedRead

The HTTP Archive Tracks how the web is built by periodically crawl the top sites on the web and record detailed information about fetched resources, used web platform APIs and features, and execution traces of each page.

ArchivedRead

Crawl Techniques

Stealth mode: Applies various techniques to make detection of headless puppeteer harder.. Latest version: 2.11.1, last published: 3 months ago. Start using puppeteer-extra-plugin-stealth in your project by running `npm i puppeteer-extra-plugin-stealth`. There are 334 other projects in the npm registry using puppeteer-extra-plugin-stealth.

puppeteer, puppeteer-extra, puppeteer-extra-plugin, stealth, stealth-mode, detection-evasion, crawler, chrome, headless, pupeteerArchivedRead

I want to share lists of links, but make them readable and archived

posts, projects, 11ty, Node, WiP, fetch, Context PagesArchivedRead

Other languages:

ArchivedRead


A guide for how to discover cool things on the internet.

ArchivedRead

Hello. I was going to write a post about how to surf the web only I remembered it had already been written, in a far more comprehensive format, by another person. So I'm just going to link to it and…

ArchivedRead

Meilisearch is neat together with their tokenizer lib they use. More practically DocSearch is great for plug and use solution. Tantivy, Quickwit & Edgesearch are interesting too.

ArchivedRead

This article is a stub. You can help the IndieWeb wiki by expanding it.

ArchivedRead

Hey nerds: I recently stumbled across “Marginalia Search”. It’s a search engine with a fascinating design — rather than give you exactly what you’re looking for, it tries to surprise you.

ArchivedRead

Indie Map is a complete crawl of 2300 of the most active IndieWeb sites as of June 2017, sliced and diced and rolled up in a few useful ways:

ArchivedRead

🍵️

ArchivedRead

The way to improve search is not to mimic Google, but instead to build boutique search engines that index, curate, and organize things in new ways.

ArchivedRead

bookmark

ArchivedRead

The source code and instructions to create your own version of Wiby.

ArchivedRead

Kyle Chayka writes about the evolution of Google Search, which has become the runaway favorite Internet search engine despite many users’ misgivings about how the company monetizes the data it collects and how its algorithms determine the search results that a user is shown.

infinite scroll, new yorker favorites, google, search engines, internet, digital technology, algorithmic bias, textbelowcenterfullbleednocontributor, web, tagsArchivedRead