Hacker News
- The Promise of Synthetic Data https://news.mit.edu/2020/real-promise-synthetic-data-1016 4 comments
- Synthetic data generation for tabular data https://github.com/sdv-dev/SDV 12 comments
- _synthesize, the Developer Conference for Synthetic Data https://gretel.ai/synthesize2023 2 comments
- Generate Synthetic Data in 3 Lines of Code https://gretel.ai/blog/generate-synthetic-data-in-3-lines-of-code 16 comments
- Using Llama3.1 405B to generate political synthetic data https://www.oxen.ai/Laurence/political-spam/file/main/texts.parquet?query_id=f6bbb123-1453-4e02-a477-4bebdc379b0e 3 comments
- Synthetic Data from Diffusion Models Improves ImageNet Classification https://arxiv.org/abs/2304.08466 60 comments
- Show HN: I used GPT-3 to make synthetic training data https://www.ameliormate.com/gpt3-synthetic-data 2 comments
- Show HN: Trying to pollute unethical America PAC's database with synthetic data https://github.com/amarcheschi/SPAMerica-PAC 2 comments
- Q* Hypothesis: Enhancing Reasoning, Rewards, and Synthetic Data https://www.interconnects.ai/p/q-star 63 comments
- 'Nemotron-4 340B' model redefines synthetic data generation, rivals GPT-4 https://venturebeat.com/ai/nvidias-nemotron-4-340b-model-redefines-synthetic-data-generation-rivals-gpt-4/ 5 comments
- Show HN: Curator – an open-source library for synthetic data generation https://github.com/bespokelabsai/curator 6 comments
- Show HN: Kiln - Interactive LLM fine-tuning, dataset collab & synthetic data gen https://github.com/Kiln-AI/Kiln/blob/main/guides/Fine%20Tuning%20LLM%20Models%20Guide.md 2 comments
- Nvidia Bets Big on Synthetic Data https://www.wired.com/story/nvidia-gretel-acquisition-synthetic-training-data/ 5 comments technews
- Census Bureau's use of 'synthetic data' worries researchers https://apnews.com/article/census-2020-technology-data-privacy-business-be938fa5db887a0ae6858dff0be217ef 20 comments politics
- Census Bureau's use of 'synthetic data' worries researchers https://apnews.com/article/census-2020-technology-data-privacy-business-be938fa5db887a0ae6858dff0be217ef 12 comments politics
- Synthetic monitoring of APIs with distributed trace data? https://github.com/kubeshop/tracetest 4 comments sre
- Announcing synthesize 2023, the developer conference for synthetic data https://gretel.ai/synthesize2023 2 comments learnmachinelearning
- Announcing synthesize 2023, the developer conference for synthetic data https://gretel.ai/synthesize2023 2 comments artificial
- Synthetic Data Generation for Computer Vision in Blender (part 1) https://5agado.medium.com/synthetic-data-generation-for-computer-vision-in-blender-part-1-6926819b11e6 3 comments learnmachinelearning
- [D] Synthetic data for AI among the 10 Breakthrough Technologies 2022 of the MIT Tech Review https://www.technologyreview.com/2022/02/23/1044965/ai-synthetic-data-2/ 20 comments machinelearning
- Seeking feedback on a synthetic data tooling project https://www.conjure-ai.com/ 4 comments computervision
- Semantic segmentation on synthetic data (CAD images) https://drive.google.com/drive/folders/1qzUXqEx-oHKyADQ2MHUyB4Foo1HeRMl0?usp=sharing 20 comments computervision
- Generating Synthetic Data Sets with ‘synthpop’ in R https://www.r-bloggers.com/generating-synthetic-data-sets-with-synthpop-in-r/ 6 comments datascience
- New way to write DNA could turbocharge synthetic biology and data storage https://www.sciencemag.org/news/2018/10/new-way-write-dna-could-turbocharge-synthetic-biology-and-data-storage 10 comments science
- open source postgres data anonymization and synthetic data generation https://github.com/nucleuscloud/neosync 6 comments postgresql
- We should all be worried about synthetic data | Making up the world through made-up data https://iai.tv/articles/we-should-all-be-worried-about-synthetic-data-auid-2138&utm_source=reddit&_auid=2020 46 comments futurology
- [R] Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data https://arxiv.org/abs/2404.01413 13 comments machinelearning
- Simple synthetic data reduces sycophancy in LLMs [R] https://arxiv.org/abs/2308.03958 2 comments machinelearning
- The multi-billion-dollar potential of synthetic data https://venturebeat.com/ai/the-multi-billion-dollar-potential-of-synthetic-data/ 3 comments technology
- Open source human rights group uses synthetic data to uncover war crimes https://twitter.com/gretel_ai/status/1469415837241073673?s=21 7 comments opensource
- Interesting TechCrunch article on how deep learning with synthetic data will democratize the tech industry! https://www.reddit.com/r/Neuromation/comments/8jcncm/check_out_this_techcrunch_article_on_how_deep/ 4 comments deeplearning
- Neuromation. Distributed Synthetic Data Platform for Deep Learning Applications https://neuromation.io/ 5 comments ethereum
- [P] How we generate high-quality synthetic time-series data for one of the largest financial institutions in the world. https://gretel.ai/blog/creating-synthetic-time-series-data-for-global-financial-institutions-a-poc-deep-dive 5 comments machinelearning
- Researchers at USC reviewed data from several clinical trials. They reported that patients receiving either whole-plant cannabis containing THC or plant-derived medicines containing both THC and CBD reported more improvements in pain intensity compared to those receiving synthetic cannabinoids. https://pc.jdapm.org/DOIx.php?id=10.17245/jdapm.2021.21.6.479 16 comments science
- New synthetic antibiotic overcomes some of the most popular bacterial resistances such as MLSb cross-resistance with promising data in mice. https://www.nature.com/articles/s41586-021-04045-6 8 comments science
- Scientists present an approach to encode medical history on a patient using quantum dots in the skin, which are invisible to the naked eye yet detectable when exposed to near-infrared light, with success on rats and synthetic human skin. This may lead to decentralized data storage and biosensing. https://stm.sciencemag.org/content/11/523/eaay7162 111 comments science
- Scientists 3D-printed plastic bunnies with synthetic DNA inside that holds the blueprints to make more bunnies. The proof-of-concept showcases DNA's potential as a hyper-dense way of storing data. Theoretically, a billion gigabytes can be stored in something about the size of a few sugar cubes. https://www.discovermagazine.com/technology/this-plastic-bunny-is-filled-with-artificial-dna-the-data-inside-more 7 comments science
- In a dramatic change from 2010, the most recent data finds that fentanyl (or another synthetic opioid) is present in the majority of U.S. individuals who died of drug overdose. https://jamanetwork.com/journals/jama/fullarticle/2679931 7 comments science
- Right-to-Carry Laws and Violent Crime: A Comprehensive Assessment Using Panel Data and a State-Level Synthetic Controls Analysis http://www.nber.org/papers/w23510 10 comments politics
- Synthetic double-helix faithfully stores Shakespeare's sonnets. 'Error-free’ technique encodes very large files in DNA for the first time. the method could store all of CERN's 90 petabytes of data on 41 grams of DNA http://www.nature.com/news/synthetic-double-helix-faithfully-stores-shakespeare-s-sonnets-1.12279 11 comments science