Data Products (3,897)
Swash provides fully compliant cookie-less browsing clickstream data from 1.5 million worldwide users, encompassing demographics, user-level time-stamped raw/processed data feed, and search and URL visits. The dataset offers insights into market intelligence, consumer behavior, and strategies optimization.
Inference Monthly Raw Data MonthlyRAG Compatibility 4.7/5
Swash User Search and Consumer Journey Data provides GDPR-compliant clickstream insights sourced from 1.5 million users worldwide, covering 182 countries. The data includes search queries, search results, clicked ads, next pages visited, and potential purchase activities across desktop and mobile browsing.
Inference API MonthlyRAG Compatibility 4.5/5
Swash provides GDPR compliant clickstream data from 1.5 million worldwide users covering demographics, user behavior, and raw/processed data feed across desktop and mobile browsing.
Inference Raw Data MonthlyRAG Compatibility 4.5/5
Syntegra Synthetic Claims Data offers patient-level, synthetic Medicare claims data based on real U.S. healthcare data. The data is available in multiple formats and can be used to expand cohorts, balance populations, and develop products.
Inference Monthly One-off Purchase API Real-time MonthlyRAG Compatibility 4.5/5
Syntegra Synthetic EHR Data offers synthetic patient-level data derived from U.S.-based hospital EHR systems. It provides detailed patient journey information and rich healthcare data not available elsewhere, suitable for various analytics and AI/ML model training.
Inference Monthly Annual One-off Purchase API Raw Data Real-time QuarterlyRAG Compatibility 4.2/5
Ainnotate offers a synthetic dataset for AI training, generated using large scale generative modeling and domain randomization. The dataset covers various domains like internal services, financial services, and healthcare, providing statistically significant data for algorithm training.
Training API Raw Data MonthlyRAG Compatibility 3.5/5
The Synthetic Document AI Dataset by Ainnotate offers 10,000 images in JPEG, PNG, and PDF formats for training AI algorithms. The dataset is designed to consider real-world variables and statistically significant data to enhance simulation and model training.
Training Pay Per Use API Raw DataRAG Compatibility 2.0/5
Mirage offers synthetic image data for computer vision models, with perfectly annotated bounding box, segmentation, keypoint, depth, and normals. The dataset covers 249 countries and is generated on 3D game engines, providing error-free data for various applications.
Fine-tuning Raw Data MonthlyRAG Compatibility 3.5/5
The Synthpop Dataset offers a curated selection of audio tracks with detailed metadata, tailored for innovative machine learning applications focusing on the Synthpop genre's distinct sound. It includes chords, instrumentation, key, tempo, and timestamp information.
Inference One-off Purchase API MonthlyRAG Compatibility 4.5/5
The Synthwave Dataset is a curated collection geared towards advanced machine learning applications. It consists of audio tracks enriched with metadata like chords, instruments, key signatures, tempo, and timestamps. This dataset uniquely blends intricate musical data with the nostalgic electronic sounds of the 1980s genre, offering a specialized resource for training models in generative AI music, Music Information Retrieval (MIR), and source separation.
Inference One-off Purchase Raw Data MonthlyRAG Compatibility 4.5/5