The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.
-
Updated
Jun 11, 2024 - Python
The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.
A list of tools for annotating data, managing annotations, etc.
🍳 Recipes for the Prodigy, our fully scriptable annotation tool
🧬 A JupyterLab extension for annotating data with Prodigy
The Wearables Development Toolkit - a development environment for activity recognition applications with sensor signals
Social Media Mining Toolkit (SMMT) main repository
An AI-driven solution for enhancing safety at construction sites. Utilises YOLOv8 for object detection to identify overhead hazards like heavy loads and steel pipes. Alerts are triggered if personnel are detected beneath these hazards. Dataset sourced from Taiwan's construction industry.
🔥 One of the most comprehensive open-source data annotation platform.
Tornado is an open source Human-in-the-loop machine learning tool. It helps you label your dataset on the fly while training your model through a simple web user interface. It supports all data types: structured, text and image.
A system for prompted weak supervision.
PersianDataAnnotations is ASP.NET Core MVC & ASP.NET MVC Custom Localization DataAnnotations (Localized MVC Errors) for Persian(Farsi) language - فارسی سازی خطاهای اعتبارسنجی توکار ام.وی.سی. و کور.ام.وی.سی. برای نمایش اعتبار سنجی سمت کلاینت
Data-centric AI building blocks for computer vision applications
Visualization and Annotation Tool for ROS
🧬 A VS Code extension for annotating data with Prodigy
This is a tool to annotate the focus plane of z-stacked images.
AnnoTheia is a data annotation toolkit that identifies when a person speaks in a scene and transcribes their speech, also offering flexibility to replace modules for different languages.
Use Large Language Models like OpenAI's GPT-3.5 for data annotation and model enhancement. This framework combines human expertise with LLMs, employs Iterative Active Learning for continuous improvement, and integrates CleanLab (Confident Learning) to ensure high-quality datasets and better model performance
Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-information"
A PointRCNN version of SAnE, which is a web-based semi-automatic annotation tool for point cloud data.
Simple Telegram bot to annotate and varify automatic speech recognition datasets
Add a description, image, and links to the data-annotation topic page so that developers can more easily learn about it.
To associate your repository with the data-annotation topic, visit your repo's landing page and select "manage topics."