Data Products (3,898)
Nexdata provides off-the-shelf face anti-spoofing data covering 2D/3D liveness detection, infrared face, gait recognition, re-id, lip language, and OCR. The dataset includes 200,000 IDs with a diverse population distribution, collected in various environments with over 97% accuracy.
Training One-off Purchase Raw Data Real-timeRAG Compatibility 3.5/5
Nexdata provides high-quality training data and annotation services for Natural Language Processing (NLP) tasks such as supervised fine-tuning (SFT), reinforcement learning from human feedback (RLHF), and red teaming services. It covers 141 countries and offers 5 years of historical data.
Inference One-off Purchase API MonthlyRAG Compatibility 4.5/5
Nexdata offers off-the-shelf gesture recognition data with 10,000 IDs, covering scenes like conference, in-car, and home. The data includes diverse gestures, race, gender, and age distributions collected from indoor office, in-car, and conference environments.
Inference API Real-time MonthlyRAG Compatibility 4.3/5
Nexdata provides high-quality Annotated Imagery Data annotation services for bounding box, polygon, segmentation, polyline, key points, image classification, and description, with historical data coverage of 5 years across 145 countries.
Inference One-off Purchase API MonthlyRAG Compatibility 4.5/5
Nexdata offers a versatile collection of unlabeled text data, NLP data, multilingual parallel corpus, and annotated imagery data. The dataset includes 800 TB of data and provides 5 years of historical coverage across 90 countries.
Inference One-off Purchase API Raw Data MonthlyRAG Compatibility 4.5/5
Nexdata Lip Multimodal Data consists of 2,000 IDs of audio and image data collected from various angles and scenes using cellphones. The dataset offers diverse annotated imagery data with 95% accuracy, covering 93 countries and available in various formats including .bin, .json, and .xml.
Training One-off Purchase API Raw Data Real-timeRAG Compatibility 3.5/5
Nexdata offers off-the-shelf human body data with behavior recognition, segmentation, key points tracking, and more. The dataset includes 300,000 IDs covering Asians, Caucasians, and black people across different age ranges and environments in 140 countries.
Inference API MonthlyRAG Compatibility 4.6/5
Nexdata provides a multilingual parallel corpus dataset with 200 million pairs of text data for AI & ML training, translation, and natural language processing. The dataset covers various fields like spoken language, traveling, medical treatment, news, and finance.
Inference API StaticRAG Compatibility 4.5/5
Nexdata offers a comprehensive dataset containing read speech data in over 100 languages, totaling 65,000 hours. The data is collected from native speakers and covers various topics such as economics, entertainment, news, and more, with a focus on AI and ML training.
Inference API Raw Data DailyRAG Compatibility 4.5/5
Nexdata offers 5,000 hours of multilingual code-switching speech data for audio AI & ML training and natural language processing (NLP) purposes. The data includes a mix of multi-language sentences in various scenarios and settings, with high accuracy.
Inference API Daily MonthlyRAG Compatibility 4.3/5