Skip to content
View adbar's full-sized avatar

Organizations

@deutschestextarchiv

Block or report adbar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A minimalist text editor that lives in URL

HTML 1,386 100 Updated Jan 5, 2026

Enhancing Cross-Lingual Transfer through Reversible Transliteration: A Huffman-Based Approach for Low-Resource Languages (ACL 2025)

Python 5 Updated Aug 12, 2025

Scalable data pre processing and curation toolkit for LLMs

Python 1,331 205 Updated Jan 5, 2026

BirdNET analyzer for scientific audio data processing.

Python 1,349 235 Updated Dec 18, 2025

Identify bird sounds in real time with this Android version of BirdNET. Bird sound recognition for more than 6,000 species worldwide.

Kotlin 759 41 Updated Dec 7, 2025

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Python 3,809 262 Updated May 17, 2025

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…

MDX 23,804 2,545 Updated Jan 5, 2026

Next-generation Punkt sentence boundary detection with zero dependencies

Python 26 1 Updated Nov 18, 2025

Visualize Different Text Splitting Methods

JavaScript 316 51 Updated Jan 2, 2025

Sample code for deep learning & neural networks

Python 201 60 Updated May 1, 2025

Financial data platform for analysts, quants and AI agents.

Python 57,353 5,555 Updated Jan 5, 2026

🔢 Work with static vector models

Python 36 Updated Apr 21, 2025

Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy

Python 1,447 83 Updated Dec 22, 2025

Kowalski, analysis

Go 10 Updated Feb 19, 2025

Convert news articles, blog posts (and more) into audio podcast episodes using natural-sounding AI text-to-speech models

SCSS 4 Updated Nov 19, 2025

An extremely fast Python linter and code formatter, written in Rust.

Rust 44,967 1,684 Updated Jan 6, 2026

A bridge between Lichess bots and chess engines

Python 955 526 Updated Dec 31, 2025

Curated list of datasets and tools for post-training.

4,144 337 Updated Nov 10, 2025

Feature set analysis for chess NNUE networks

Rust 7 Updated Dec 1, 2024

Play chess via GitHub

1,108 177 Updated Jan 6, 2026

Sunfish: a Python Chess Engine in 111 lines of code

Python 3,174 572 Updated May 17, 2025

A chess library for Python, with move generation and validation, PGN parsing and writing, Polyglot opening book reading, Gaviota tablebase probing, Syzygy tablebase probing, and UCI/XBoard engine c…

Python 2,725 556 Updated Jan 3, 2026

Python bindings for Ada URL parser

C++ 64 7 Updated Oct 1, 2025

WHATWG-compliant and fast URL parser written in modern C++, part of Internet Archive, Node.js, Clickhouse, Redpanda, Kong, Telegram, Adguard, Datadog and Cloudflare Workers.

C++ 1,666 117 Updated Jan 5, 2026

List of libraries, tools and APIs for web scraping and data processing.

Makefile 7,714 852 Updated Oct 13, 2025

Build a RAG dataset for your domain in just a few lines of codes, using your XML sitemap

Python 48 3 Updated Aug 24, 2024

Chatmail Rust Core library, used by Android/iOS/desktop chatmail apps, bindings and bots 📧

Rust 816 117 Updated Jan 5, 2026

🕸 GlotWeb: Web Indexing for Low-Resource Languages -- under construction.

Python 17 Updated Aug 13, 2025

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 27,769 2,526 Updated Sep 30, 2025
Next