Machine learning requires large amounts of data to develop predictive models of the world. Help foster the development of machine learning by sharing or selling datasets you have developed or acquire new datasets to jumpstart your next project.
Collection of articles from the category of Counterfeit Money including. Unique user_id, views, category, title, keywords, content, created_at, updated_at
Collection of articles from the category of Contacting Financial Companies including. Unique user_id, views, category, title, keywords, content, created_at, updated_at
Collection of articles from the category of Checks and Checkbooks including. Unique user_id, views, category, title, keywords, content, created_at, updated_at
Legal Clause Classification Dataset built from various sources like multiple contracts, online contract texts etc and Label them into 24 categories. The sole purpose of this dataset is to identify any given contract text into one of the Clause labels. Although the pre-defined categories can be customized according to the user requirements and same goes for the dataset contract text.
www.predictly.co
Twitch stream data collected from ~2500 popular Twitch streamers over 4 months (9/24/2020 to 2/05/2021). Real time data for live streamers updated approx. every 5 minutes. Dataset includes current timestamp, streamer name, stream title, game_id, stream start time, and viewership count. Contains 7,936,251 live stream data instances.
This is a data set having tweets along with the user name who tweeted that with the time of tweet creation.Here whole dataset is unique.
To perform sentimental analysis and hate speech recognition this data set is ready to use database.