Context We made an online application that helped people to diagnose the most likely disease they had based on the symptoms they provided. Before using the application the users had to enter their age group and gender and then continue forward. When a likely disease is diagnosed, the user's age group, gender and the likely diseases are updated into the dataset in a JSON file. This dataset can be used for various future predictions like which type of diseases are concentrated in certain age groups. Content Sex: Gender of the user. M or F ('M' for male and 'F' for female) Age: Age group of the users. Diagnosis: The likely diseases that were diagnosed based on the symptoms provided by the user. These diagnosis can have multiple diseases. Acknowledgements We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research. Inspiration Your data will be in front of the world's largest data science community. What questions do you want to see answered?