List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
-
Updated
Oct 31, 2023
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
[CVPR 2020--Oral] CycleISP: Real Image Restoration via Improved Data Synthesis
Computer vision utils for Blender (generate instance annoatation, depth and 6D pose by one line code)
[CVPR 2023] Label-Free Liver Tumor Segmentation
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.
Coursera - RNN Programming Assignment: In this project, we will construct a speech dataset and implement an algorithm for trigger word detection (sometimes also called keyword detection, or wake word detection).
A data framework for music information retrieval focusing on electronic music.
Repository for the results of my master thesis, about the generation and evaluation of synthetic data using GANs
Apache NiFi Data Synthesizer
A data synthesizer for creating datasets of feet from a first-person perspective.
The Coastal Carbon Network Data Library: An open-source database featuring carbon data from tidal wetlands around the world
Boosting Document Intelligence
Official implementaion of EMNLP 2022 paper "Generate, Discriminate, and Contrast: A Semi-Supervised Sentence Representation Learning Framework"
Benchmarking RGBD SLAM robustness under sensor and motion perturbations.
Code & data for ICLR 2024 spotlight paper: 🍯MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data
Source code for LDPTrace: Locally Differentially Private Trajectory Synthesis. VLDB 2023.
Synthesis data in YOLO format given background and object images
Since the times of d'Alembert, Lagrange and Euler humans like to add fictitious dimensions to their real-world physical and mathematical problems. This art was perfected in the XX-th century by Heisenberg, Pauli and Dirac in their 'matrix mechanics'. In the XXI-st century we can contribute to this proud tradition too, we have computers! :)
Comprehensive reproduction of the paper "BNT162b2 mRNA Covid-19 Vaccine in a Nationwide Mass Vaccination Setting" by Noa Dagan, MD, et al., assisted by Professor Yair Goldberg. This statistical project explores vaccination's multifaceted impact on infection rates, employing synthetic data, advanced matching, and sophisticated statistical analysis.
Repository for Slide Deck and Code Examples for talk at SDP Convening 2023
Add a description, image, and links to the data-synthesis topic page so that developers can more easily learn about it.
To associate your repository with the data-synthesis topic, visit your repo's landing page and select "manage topics."