souvenir
Tag cloud
Picture wall
Daily
RSS Feed
  • RSS Feed
  • Daily Feed
Filters

Links per page

  • 20 links
  • 50 links
  • 100 links

Filters

Untagged links
page 1 / 2
35 results tagged data  ✕   ✕
Dolt https://github.com/dolthub/dolt
04/08/2025 cluster icon
  • re:dash : Query Editor: think of JS Fiddle for SQL queries. It's your way to share data in the organization in an open way, by sharing both the dataset and the ...
  • Dolt : Dolt is a relational database, i.e. it has tables, and you can execute SQL queries against those tables. It also has version control primitives that o...
  • Splitgraph : Splitgraph is a data management, building and sharing tool inspired by Docker and Git that works on top of PostgreSQL and integrates seamlessly with a...
  • FillDB : FillDB is a free tool that lets you quickly generate large volumes of custom data in MySql format to use in testing software and populating databases ...
  • VersionPress : VersionPress is a free and open source version control plugin for WordPress built on Git. You can: Undo changes Create staging sites Merge databases ...

Dolt is a SQL database that you can fork, clone, branch, merge, push and pull just like a Git repository.

Connect to Dolt just like any MySQL database to read or modify schema and data. Version control functionality is exposed in SQL via system tables, functions, and procedures.

Or, use the Git-like command line interface to import CSV files, commit your changes, push them to a remote, or merge your teammate's changes. All the commands you know for Git work exactly the same for Dolt.

data database versioncontrol version sql mysql git
Grist https://www.getgrist.com/
25/07/2025 cluster icon
  • Appwrite : Appwrite is an end-to-end backend server for Web, Mobile, Native, or Backend apps packaged as a set of Docker microservices. Appwrite abstracts the co...
  • FillDB : FillDB is a free tool that lets you quickly generate large volumes of custom data in MySql format to use in testing software and populating databases ...
  • Beehive : Beehive is an event and agent system, which allows you to create your own agents that perform automated tasks triggered by events and filters. It is m...
  • ProjectPier.org : ProjectPier is a Free, Open-Source, PHP application for managing tasks, projects and teams through an intuitive web interface. It must be downloaded a...
  • Tiny Tiny RSS : Tiny Tiny RSS is a free and open source web-based news feed (RSS/Atom) reader and aggregator.

Grist is a modern relational spreadsheet. It combines the flexibility of a spreadsheet with the robustness of a database.

spreadsheet data database opensource software
dbt https://github.com/dbt-labs/dbt-core
04/07/2025 cluster icon
  • pgLoader : pgloader loads data into PostgreSQL and allows you to implement Continuous Migration from your current database to PostgreSQL. pgLoader has two modes ...
  • Kaitai Struct : A new way to develop parsers for binary structures. Declarative: describe the very structure of the data, not how you read or write it
  • pg_chameleon : pg_chameleon is a MySQL to PostgreSQL replica system written in Python 3. The tool can connect to the mysql replication protocol and replicate the dat...
  • mirador : Mirador is a tool for visual exploration of complex datasets. It enables users to discover correlation patterns and derive new hypotheses from the dat...
  • CyberChef : A simple, intuitive web app for analysing and decoding data without having to deal with complex tools or programming languages. CyberChef encourages b...

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

data etl tool
Kestra https://github.com/kestra-io/kestra
29/01/2025 cluster icon
  • sim : Open-source platform to build and deploy AI agent workflows.
  • Airflow : Airflow is a platform created by the community to programmatically author, schedule and monitor workflows.
  • Flyte : Flyte is an open-source, container-native, structured programming and distributed processing platform implemented in Golang. It enables highly concurr...
  • Beehive : Beehive is an event and agent system, which allows you to create your own agents that perform automated tasks triggered by events and filters. It is m...
  • n8n.io : n8n (pronounced nodemation) helps you to interconnect each and every app with an API in the world with each other to share and manipulate its data wit...

Kestra is an open-source, event-driven orchestration platform that makes both scheduled and event-driven workflows easy. By bringing Infrastructure as Code best practices to data, process, and microservice orchestration, you can build reliable workflows directly from the UI in just a few lines of YAML.

workflow automation platform opensource scheduler events data
Streamlit https://streamlit.io/
05/03/2023 cluster icon
  • Pyramid : Pyramid is a very general open source Python web framework.
  • Flask : Flask is a microframework for Python based on Werkzeug, Jinja 2 and good intentions. And before you ask: It's BSD licensed!
  • Pinax : Pinax is an open-source platform built on the Django Web Framework. By integrating numerous reusable Django apps and providing starter projects and in...
  • Chainer : Chainer is a Python-based deep learning framework aiming at flexibility. It provides automatic differentiation APIs based on the define-by-run approac...
  • Roll : Roll is a pico framework with performances and aesthetic in mind.

Streamlit is an open source app framework in Python language. It helps us create web apps for data science and machine learning in a short time.

python framework web data machinelearning
Flyte https://flyte.org/
27/03/2022 cluster icon
  • Kestra : Kestra is an open-source, event-driven orchestration platform that makes both scheduled and event-driven workflows easy. By bringing Infrastructure as...
  • jina : Jina is a neural search framework that allows to build deep learning search applications in minutes. It provides scalable indexing, querying, understa...
  • Fider : Fider is an open platform to collect and organize customer feedback. By using the platform, your community can share, vote and discuss on ideas they h...
  • dotmesh : Dotmesh captures snapshots from stateful Docker and Kubernetes applications as ’datadots‘, and gives you the ability to label, store and share them as...
  • Ponzu : Ponzu is a powerful and efficient open-source HTTP server framework and CMS. It provides automatic, free, and secure HTTP/2 over TLS (certificates obt...
thumbnail

Flyte is an open-source, container-native, structured programming and distributed processing platform implemented in Golang. It enables highly concurrent, scalable and maintainable workflows for machine learning and data processing.

opensource data machinelearning platform go
FillDB http://filldb.info/
31/12/2021 cluster icon
  • Dolt : Dolt is a SQL database that you can fork, clone, branch, merge, push and pull just like a Git repository. Connect to Dolt just like any MySQL database...
  • pg_chameleon : pg_chameleon is a MySQL to PostgreSQL replica system written in Python 3. The tool can connect to the mysql replication protocol and replicate the dat...
  • re:dash : Query Editor: think of JS Fiddle for SQL queries. It's your way to share data in the organization in an open way, by sharing both the dataset and the ...
  • Metabase : Metabase is the easy, open source way for everyone in your company to ask questions and learn from data.
  • MySQLTuner : MySQLTuner is a script written in Perl that will assist you with your MySQL configuration and make recommendations for increased performance and stabi...
thumbnail

FillDB is a free tool that lets you quickly generate large volumes of custom data in MySql format to use in testing software and populating databases with random data.

database mysql testing data
pydantic https://pydantic-docs.helpmanual.io/
07/11/2021 cluster icon
  • docopt : Command-line interface description language. docopt helps you define interface for your command-line app, and automatically generate parser for it.
  • Data Science Toolkit : A collection of the best open data sets and open-source tools for data science, wrapped in an easy-to-use REST/JSON API with command line, Python and ...
  • FoxDot : FoxDot is an easy-to-use Python library that creates an interactive programming environment and talks to the powerful sound synthesis engine, called S...
  • Kaitai Struct : A new way to develop parsers for binary structures. Declarative: describe the very structure of the data, not how you read or write it
  • Kartograph : The Kartograph map generator has just one method, which is there to generate SVG maps (surprise).

Data validation and settings management using python type annotations.

python data validation library
jina https://github.com/jina-ai/jina
01/10/2021 cluster icon
  • Weaviate : Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filteri...
  • MeiliSearch : MeiliSearch is a powerful, fast, open-source, easy to use and deploy search engine. Both searching and indexing are highly customizable. Features such...
  • Flyte : Flyte is an open-source, container-native, structured programming and distributed processing platform implemented in Golang. It enables highly concurr...
  • Memex : Browser Extension to full-text search your browsing history & bookmarks.
  • Searx : Searx is a free internet metasearch engine which aggregates results from more than 70 search services. Users are neither tracked nor profiled. Additio...
thumbnail

Jina is a neural search framework that allows to build deep learning search applications in minutes. It provides scalable indexing, querying, understanding of any data: video, image, long/short text, music, source code, PDF, etc.

opensource search machinelearning search indexing data
Superset https://superset.apache.org/
13/04/2021 cluster icon
  • Grafana : Grafana provides a powerful and elegant way to create, explore, and share dashboards and data with your team and the world. Grafana is most commonly u...
  • Superset : Superset is a data exploration platform designed to be visual, intuitive and interactive. Superset's main goal is to make it easy to slice, dice and v...
  • re:dash : Query Editor: think of JS Fiddle for SQL queries. It's your way to share data in the organization in an open way, by sharing both the dataset and the ...
  • Metabase : Metabase is the easy, open source way for everyone in your company to ask questions and learn from data.
  • Kestra : Kestra is an open-source, event-driven orchestration platform that makes both scheduled and event-driven workflows easy. By bringing Infrastructure as...

Apache Superset is a modern data exploration and visualization platform

visualization data dashboard opensource
Splitgraph https://www.splitgraph.com/
25/07/2020 cluster icon
  • Dolt : Dolt is a SQL database that you can fork, clone, branch, merge, push and pull just like a Git repository. Connect to Dolt just like any MySQL database...
  • re:dash : Query Editor: think of JS Fiddle for SQL queries. It's your way to share data in the organization in an open way, by sharing both the dataset and the ...
  • pg_chameleon : pg_chameleon is a MySQL to PostgreSQL replica system written in Python 3. The tool can connect to the mysql replication protocol and replicate the dat...
  • Dolt : Dolt is a relational database, i.e. it has tables, and you can execute SQL queries against those tables. It also has version control primitives that o...
  • pgLoader : pgloader loads data into PostgreSQL and allows you to implement Continuous Migration from your current database to PostgreSQL. pgLoader has two modes ...
thumbnail

Splitgraph is a data management, building and sharing tool inspired by Docker and Git that works on top of PostgreSQL and integrates seamlessly with anything that uses PostgreSQL.

Splitgraph allows the user to manipulate data images (snapshots of SQL tables at a given point in time) as if they were code repositories by versioning, pushing and pulling them.

data versioncontrol sql database tool
Dolt https://github.com/liquidata-inc/dolt
07/04/2020 cluster icon
  • Dolt : Dolt is a SQL database that you can fork, clone, branch, merge, push and pull just like a Git repository. Connect to Dolt just like any MySQL database...
  • Splitgraph : Splitgraph is a data management, building and sharing tool inspired by Docker and Git that works on top of PostgreSQL and integrates seamlessly with a...
  • dotmesh : Dotmesh captures snapshots from stateful Docker and Kubernetes applications as ’datadots‘, and gives you the ability to label, store and share them as...
  • Metabase : Metabase is the easy, open source way for everyone in your company to ask questions and learn from data.
  • VersionPress : VersionPress is a free and open source version control plugin for WordPress built on Git. You can: Undo changes Create staging sites Merge databases ...
thumbnail

Dolt is a relational database, i.e. it has tables, and you can execute SQL queries against those tables. It also has version control primitives that operate at the level of table cell. Thus Dolt is a database that supports fine grained value-wise version control, where all changes to data and schema are stored in commit log.

data database versioncontrol version
Presidio https://github.com/microsoft/presidio
08/09/2019 cluster icon
  • pg_chameleon : pg_chameleon is a MySQL to PostgreSQL replica system written in Python 3. The tool can connect to the mysql replication protocol and replicate the dat...
  • pgLoader : pgloader loads data into PostgreSQL and allows you to implement Continuous Migration from your current database to PostgreSQL. pgLoader has two modes ...
  • CyberChef : A simple, intuitive web app for analysing and decoding data without having to deal with complex tools or programming languages. CyberChef encourages b...
  • dbt : dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
  • Splitgraph : Splitgraph is a data management, building and sharing tool inspired by Docker and Git that works on top of PostgreSQL and integrates seamlessly with a...
thumbnail

Context aware, pluggable and customizable data protection and PII data anonymization service for text and images

anonymity tool data
librehosters https://libreho.st/
20/07/2019 cluster icon
  • Dolt : Dolt is a SQL database that you can fork, clone, branch, merge, push and pull just like a Git repository. Connect to Dolt just like any MySQL database...
  • mirador : Mirador is a tool for visual exploration of complex datasets. It enables users to discover correlation patterns and derive new hypotheses from the dat...
  • FillDB : FillDB is a free tool that lets you quickly generate large volumes of custom data in MySql format to use in testing software and populating databases ...
  • Grist : Grist is a modern relational spreadsheet. It combines the flexibility of a spreadsheet with the robustness of a database.
  • Celery : Celery is an asynchronous task queue/job queue based on distributed message passing. It is focused on real-time operation, but supports scheduling as ...

librehosters is a network of cooperation and solidarity that uses free software to encourage decentralisation through federation and distributed platforms. Our values connect transparency, fairness and privacy with a culture of data portability and public contributions to the commons.

freesoftware coop federation distributed data
Kaitai Struct http://kaitai.io/
16/06/2019 cluster icon
  • RAWGraphs : RAW is an open web tool to create custom vector-based visualizations on top of the amazing d3.js library. Primarily conceived as a tool for designers ...
  • pg_chameleon : pg_chameleon is a MySQL to PostgreSQL replica system written in Python 3. The tool can connect to the mysql replication protocol and replicate the dat...
  • pgLoader : pgloader loads data into PostgreSQL and allows you to implement Continuous Migration from your current database to PostgreSQL. pgLoader has two modes ...
  • pydantic : Data validation and settings management using python type annotations.
  • mirador : Mirador is a tool for visual exploration of complex datasets. It enables users to discover correlation patterns and derive new hypotheses from the dat...

A new way to develop parsers for binary structures.

Declarative: describe the very structure of the data, not how you read or write it

binary library tool data
source{d} Engine https://sourced.tech/engine/
18/11/2018 cluster icon
  • Superset : Superset is a data exploration platform designed to be visual, intuitive and interactive. Superset's main goal is to make it easy to slice, dice and v...
  • mirador : Mirador is a tool for visual exploration of complex datasets. It enables users to discover correlation patterns and derive new hypotheses from the dat...
  • Dolt : Dolt is a SQL database that you can fork, clone, branch, merge, push and pull just like a Git repository. Connect to Dolt just like any MySQL database...
  • GeoNames : The GeoNames geographical database covers all countries and contains over eleven million placenames that are available for download free of charge.
  • librehosters : librehosters is a network of cooperation and solidarity that uses free software to encourage decentralisation through federation and distributed platf...

Engineering managers and maintainers of large code bases are starting to realize the potential of Code as Data or how source code can be treated as an analyzable dataset proving valuable information. Think Business Intelligence and processes optimization based on the source code engineers write, rather than adjacent metrics.

development data analysis staticanalysis
pg_chameleon http://www.pgchameleon.org/
02/07/2018 cluster icon
  • pgLoader : pgloader loads data into PostgreSQL and allows you to implement Continuous Migration from your current database to PostgreSQL. pgLoader has two modes ...
  • Metabase : Metabase is the easy, open source way for everyone in your company to ask questions and learn from data.
  • migra : migra is a schema comparison tool for PostgreSQL. It's a command line tool, and Python library. Find differences in database schemas as easily as runn...
  • DBeaver : Free multi-platform database tool for developers, SQL programmers, database administrators and analysts. Supports all popular databases: MySQL, Postgr...
  • re:dash : Query Editor: think of JS Fiddle for SQL queries. It's your way to share data in the organization in an open way, by sharing both the dataset and the ...

pg_chameleon is a MySQL to PostgreSQL replica system written in Python 3. The tool can connect to the mysql replication protocol and replicate the data changes in PostgreSQL. Whether the user needs to setup a permanent replica between MySQL and PostgreSQL or perform an engine migration, pg_chamaleon is the perfect tool for the job.

postgresql mysql data migration tool replication database
pgLoader https://pgloader.io/
02/07/2018 cluster icon
  • pg_chameleon : pg_chameleon is a MySQL to PostgreSQL replica system written in Python 3. The tool can connect to the mysql replication protocol and replicate the dat...
  • migra : migra is a schema comparison tool for PostgreSQL. It's a command line tool, and Python library. Find differences in database schemas as easily as runn...
  • migra.djrobstep.com : migra is a schema comparison tool for PostgreSQL. It's a command line tool, and Python library. Find differences in database schemas as easily as runn...
  • Flyway : You can think of Flyway as version control for your database. It lets you evolve your database schema easily and reliably across all your instances. F...
  • Liquibase : Liquibase is an open source database-independent library for tracking, managing and applying database schema changes

pgloader loads data into PostgreSQL and allows you to implement Continuous Migration from your current database to PostgreSQL.

pgLoader has two modes of operation. It can either load data from files, such as CSV or Fixed-File Format; or migrate a whole database to PostgreSQL.

pgLoader supports several RDBMS solutions as a migration source, and fetches information from the catalog tables over a connection to then create an equivalent schema in PostgreSQL. This means that you can migrate to PostgreSQL in a single command-line!

postgresql schema data migration tool database
Falcor http://netflix.github.io/falcor/
28/05/2018 cluster icon
  • Data Science Toolkit : A collection of the best open data sets and open-source tools for data science, wrapped in an easy-to-use REST/JSON API with command line, Python and ...
  • Quicktype : Quicktype generates types and helper code for reading JSON in C#, Swift, JavaScript, Flow, Python, TypeScript, Go, Rust, Objective-C, Kotlin, C++ and ...
  • CouchDB : Apache CouchDB™ is a database that uses JSON for documents, JavaScript for MapReduce indexes, and regular HTTP for its API
  • Neutrino : Neutrino is a companion tool which lets you build web and Node.js applications with shared presets or configurations. It intends to make the process o...
  • Chai : Chai is a BDD / TDD assertion library for node and the browser that can be delightfully paired with any javascript testing framework.
thumbnail

Falcor is the innovative data platform that powers the Netflix UIs. Falcor allows you to model all your backend data as a single Virtual JSON object on your Node server. On the client you work with your remote JSON object using familiar JavaScript operations like get, set, and call. If you know your data, you know your API.

Falcor is middleware. It is not a replacement for your application server, database, or MVC framework. Instead Falcor can be used to optimize communication between the layers of a new or existing application.

javascript data json nodejs api
Public APIs https://github.com/toddmotto/public-apis
22/04/2018 cluster icon
  • Data Science Toolkit : A collection of the best open data sets and open-source tools for data science, wrapped in an easy-to-use REST/JSON API with command line, Python and ...
  • Falcor : Falcor is the innovative data platform that powers the Netflix UIs. Falcor allows you to model all your backend data as a single Virtual JSON object o...
  • Five Filters : Explore the world of non-corporate news and analysis. Our web apps make online content more accessible
  • OpenRefine : OpenRefine is a powerful tool for working with messy data: cleaning it; transforming it from one format into another; and extending it with web servic...
  • XgeneCloud : XgeneCloud is an API Framework platform that instantly generates REST / GraphQL APIs on any SQL database. Supports MySQL, PostgreSQL, MSSQL, SQLite &...
thumbnail

A collective list of free APIs for use in web development.

free api data
page 1 / 2
1650 links
Shaarli - The personal, minimalist, super-fast, database free, bookmarking service by the Shaarli community - Theme by kalvn