Discovering groups of similar Twitter users based on their profiles.
Online social media. You may love them, you may hate them, but the truth is, they have become an integral part of the modern life and they are not going to disappear anytime soon. In fact, it’s likely that this year’s situation1 has made them even more popular, as they provide a way to stay connected with friends and the world without having to step outside.
Social media platforms allow people to consume and share information in real-time, thereby serving a wide range of purposes – from sharing personal thoughts and experience to connecting customers with businesses. Twitter is arguably the biggest beast of them all. In 2019, Twitter reported 330M monthly active users (that’s roughly equivalent to the population of the US!), and 145M daily active users. Twitter profiles can be run by a variety of user types. Besides typical (human) users, we may encounter professional accounts, bussiness accounts or, unfortunately, spammers.
Having a random data sample of users from a Twitter study at hand, we decided to use it to try finding whether we can discover groups of users with similar characteristics, where characteristics is defined based on their Twitter activity.
Why is that useful? There are many reasons why – to name a few:
After some research on Twitter’s user base we conjectured that the data could reveal the following user categories:
There are of course many profiles that don’t fit any of the above descriptions, however we estimate that these categories should account for at least 80% of Twitter’s user base.
Continue to Analysis.