Jump to content

Algorithmic curation

From Wikipedia, the free encyclopedia

A feed of posts curated for a user on the Mastodon social networ

Algorithm curation is the selection of online media by recommender system and personalized search. Traditionally, curation has been the work of museum and library specialists, carefully selecting relevant materials to develop collections.[1] Today, curation takes the form of selectively sharing content online, creating and maintaining a profile on any of the various social networking platforms, and searching and compiling information for reporting.[1] Curation algorithms leverage this task by implementing different filter approaches such as collaborative filtering and content-based filtering as well as builds off of existing technologies like a recommender system and personalize search. Examples include search engine and social media products such as the Twitter feed, Facebook’s News Feed, and the Google Personalized Search.[2]

History

[edit]

Early algorithmic curations

[edit]

Algorithmic curation plays an important role in how people connect online today. [3][4]There’s so much information and data we see online that deciding what to acknowledge, such as save, archive, share, or ignore, has become a lot of work.[3][4]Therefore, platforms use newsfeed algorithms to decide what to show each user.[3][4] These algorithms are complicated, so it is not easy to know how they shape communication.[3][4]

Information overload

[edit]

Early on, platforms needed a way to filter information so that users wouldn't get overwhelmed.[3] This led to the first-generation ranking algorithms showing the most recent or most popular posts.[3]Second-generation demonstrated algorithms that curate content to keep people hooked onto the platform for a longer timespan.[3]Algorithmic curation doesn't just sort content, but it also shapes knowledge, attention, and political exposure.[4] In shaping what people view on a day-to-day basis, this gives reasons to the algorithm acting as a powerful gatekeeper and deciding what new material people are exposed to.[4]

How algorithm changes users' feeds over time

[edit]

Algorithm curation increased source diversity and also reduced the number of external links, which limited access to outside information.[4] Topics based on political content were made more relevant than some COVID-19 health information, which was subdued.[4]Using agent-based modeling, researchers understand the problems within these systems.[4] They come upon the adversities and motives of the algorithms that try to increase user engagement while making misinformation and polarization worse.[4] The target is to discover how user behavior, information, and algorithms all influence each other.[4] Therefore, as a response to information overload, algorithmic curation served as a response to the massive amount of content on social media.[3][4]

Emergence of AI

[edit]

With AI driving systems, Newsfeed now predicts, personalizes, and optimizes information, which are core AI functions.[3][4] These attributes change human perception and behavior by changing what they see, share, and think.[3][4] Now, researchers have adapted to models of computational simulation to understand how AI-driven curation shapes social outcomes at scale.[3] Other platforms like Twitter have moved to simple chronological feeds.[4] They now use complex, AI-powered ranking systems that personalize information. [4]

Algorithmic curation then evolved onto AI-powered systems focused on developing, which purposefully raises polarization.[3] Now, due to advanced scales and operations, emerging research uses advanced modeling tools to keep up with these AI systems that humans cannot understand.[3] Platforms such as Twitter have moved away from a sequential feed.[4] They switched to AI-powered computational systems to compose personalized information.[3] These systems make decisions that aren't realistic for humans to make.[4]

Approach

[edit]

Filter types

[edit]
Collaborative filtering
[edit]

Collaborative filtering (CF) methods create recommendations based on a person’s usage patterns.[5] CF predicts a person’s desire for an item by matching their interests with people who have similar interests.[5] This process allows for the sharing of ratings between like-minded people.[5] CF is based on human and not machine analysis of content.[5] Users of CF systems (found commonly in social media) rate items that they have interacted with; this rating creates a profile of interests.[5] The CF system then matches that user with other people with similar interests.[5] Once matched, the ratings from those similar users are then used to generate recommendations for the user.[5] The main advantage of collaborative filtering includes the ability to filter by various types of content such as text, art, work, music, and mutual funds; it filter[s] based on complex and hard to represent concepts, such as taste and quality.[6]

Content-base filtering
[edit]

Another popular recommendation system implementation is content-based filtering (CBF). In CBF, a user profile is built to provide information about the types of items that the user likes.[6][7] This is based on keywords used to describe the items.[6][7] In this approach, a recommendation is made by presenting similar items to what the user liked in the past (or items that are similar to what the user is looking for).[7] The CBF method creates a profile for each item based on a set of discrete attributes and features.[7] The system then creates a content-based profile for the user based on a weighted vector of item features.[6] This is made from items the user has previously rated or purchased, or from items the user is currently interested in, or presently viewing.[6] The weights represent the importance of each feature to the user.[6] There are various possible ways of computing these weights, such as Bayesian Classifiers, cluster analysis, decision trees, and artificial neural networks.[6] Regardless of the calculation technique, the goal of the weighted vector is the same: to determine the probability that the user will like a suggested item.[6] One example of content-based filtering to help describe this process is Pandora Radio. When a user visits Pandora, they are prompted to enter artist, genre, or composer to create a station.[6] Pandora then uses CBF to find music with similar attributes to the song, artist, or genre that the user provides.[6]

Technology

[edit]

Recommender system

[edit]

An algorithm is curated in part by recommender systems that rank and suggest the most relevant content to users. Content is ranked according to a particular user’s implicit and explicit input.[8] Implicit rankings include elapsed time viewing or engaging with a specific item.[8] The user’s liked items (posts, store pages, articles) and shared items make up the explicit data used by algorithms to recommend content.[8]

[edit]

By utilizing external factors outside the user’s explicit query, personalized search aims to retrieve results most relevant to the user. The user’s past queries, history, and interests create an additional context that refines the algorithm’s output.[9] [new] Platforms such as X (formerly Twitter) and Bluesky give users recommendations from similar users and the content they interact with.[10] Additionally, personalized search offers users to explicitly filter search results by giving them the option to block content containing certain phrases or hashtags from being recommended.[11] Other types of commercial websites such as search engines and retailers use similar processes to prescribe tailored information and products to a distinct user.

Media

[edit]

Social media

[edit]

Algorithmic curation in social media boils down to automated systems that pick, rank, and serve content based on how users interact and what they seem to prefer.[12] Major platforms like Facebook and Twitter lean on these algorithms to craft personalized feeds, trying to boost engagement by tracking behaviors such as likes, shares, and comments.[12] For example, Facebook's News Feed pushes content that matches users' interests, which can lock in their existing civic views through selective exposure. While this does keep users around longer, it also breeds filter bubbles where they only see familiar takes, further polarizing them.[12]

Game theory research can demonstrate how curation algorithms shape network connections and content quality as complements rather than as independent attributes.[13] Users become less picky and forge more ties, but creators lose out to increased competition.[14] Some algorithms, especially those cheap to scale, crank up polarization and feed those bubbles. Others, focused on vertical quality, might ease the problem by mixing in diverse voices.[14] Music streaming platforms run into similar issues: even with personalization, users find algorithmic recommendations a bit cold, which shapes how they listen and what they discover.[15]

As misinformation spread becomes easier, divergent perspectives get harder to find.[12] Empirical evidence shows how users actively filter out opposing views by blocking content, a habit that works with algorithms to cement echo chambers.[16] Regulatory fixes like modifying systems to reduce sensationalism can scatter clickbait, but they would also expose these systems more than would be desired.[14] Current scholarship maintains focus on ethical designs that balance engagement with genuine diversity in information.[14][16]

AI's contribution

[edit]

Artificial intelligence now plays a far more prominent role in algorithmic curation, drawing on machine-learning models that can handle vast stores of data to shape content dynamically.[17] Methods such as deep learning and reinforcement learning allow these systems to anticipate user preferences with greater precision than older techniques, making it possible for platforms to adjust what users see almost instantaneously.[17] In the context of social media and streaming services, this means AI arranges feeds to highlight what it deems relevant and carries bias from the training data.[17]

A game-theory framework illustrates how AI-driven curation on platforms encourages clickbait production.[13] When users interact in a competitive environment where algorithms reward engagement-maximizing content, they are highly incentivized to get as much attention as possible through any means.[18] Another study on music consumption differentiates human roles and algorithmic ones, demonstrating that while AI recommendations reduce diversity by increasing repetition, hybrid approaches yield better outcomes.[18]

Ethical concerns range from agency hacking to wider issues of polarization and discrimination within systems.[17] A normative framework evaluates AI curation at both individual levels, emphasizing relevance and accuracy, and broader societal levels, urging governance that protects values such as civic health.[17] As AI moves further into generative forms, combined models that pair computational speed with human judgment hold promise for greater diversity, but they also require transparency and regulatory measures to keep associated risks in check.[17]

See also

[edit]

References

[edit]
  1. ^ a b Khan, Sadia; Bhatt, Ibrar (2018), "Curation", The International Encyclopedia of Media Literacy, John Wiley & Sons, Ltd, pp. 1–9, doi:10.1002/9781118978238.ieml0047, ISBN 978-1-118-97823-8, retrieved November 21, 2025
  2. ^ Berman, Ron; Katona, Zsolt (September 2016). "The Impact of Curation Algorithms on Social Network Content Quality and Structure". Working Papers.
  3. ^ a b c d e f g h i j k l m n Gausen, Anna; Luk, Wayne; Guo, Ce (December 28, 2022). "Using Agent-Based Modelling to Evaluate the Impact of Algorithmic Curation on Social Media". J. Data and Information Quality. 15 (1): 2:1–2:24. doi:10.1145/3546915. ISSN 1936-1955.
  4. ^ a b c d e f g h i j k l m n o p q r Bandy, Jack; Diakopoulos, Nicholas (April 22, 2021). "More Accounts, Fewer Links: How Algorithmic Curation Impacts Media Exposure in Twitter Timelines". Proc. ACM Hum.-Comput. Interact. 5 (CSCW1): 78:1–78:28. doi:10.1145/3449152.
  5. ^ a b c d e f g Herlocker, Jonathan. "Explaining Collaborative Filtering Recommendations".
  6. ^ a b c d e f g h i j "Online Recommender Systems – How Does a Website Know What I Want? |". Retrieved November 21, 2025.
  7. ^ a b c d Wang, Donghui; Liang, Yanchun; Xu, Dong; Feng, Xiaoyue; Guan, Renchu (October 1, 2018). "A content-based recommender system for computer science publications". Knowledge-Based Systems. 157: 1–9. doi:10.1016/j.knosys.2018.05.001. ISSN 0950-7051.
  8. ^ a b c Roy, Deepjyoti; Dutta, Mala (May 3, 2022). "A systematic review and research perspective on recommender systems". Journal of Big Data. 9 (1): 59. doi:10.1186/s40537-022-00592-5. ISSN 2196-1115.
  9. ^ Dou, Zhicheng; Song, Ruihua; Wen, Ji-Rong (May 8, 2007). "A large-scale evaluation and analysis of personalized search strategies". Proceedings of the 16th international conference on World Wide Web. Banff Alberta Canada: ACM: 581–590. doi:10.1145/1242572.1242651. ISBN 978-1-59593-654-7.
  10. ^ Liu, Yuhan; Song, Emmy; Zhang, Owen Xingjian; Merriman, Jewel; Zhang, Lei; Monroy-Hernández, Andrés (October 16, 2025). "Understanding Decentralized Social Feed Curation on Mastodon". Proc. ACM Hum.-Comput. Interact. 9 (7): CSCW507:1–CSCW507:25. doi:10.1145/3757688.
  11. ^ Quelle, Dorian; Bovet, Alexandre (February 26, 2025). "Bluesky: Network topology, polarization, and algorithmic curation". PLOS ONE. 20 (2) e0318034. doi:10.1371/journal.pone.0318034. ISSN 1932-6203.
  12. ^ a b c d Papa, Venetia; Photiadis, Thomas (December 15, 2021). "Algorithmic Curation and Users' Civic Attitudes: A Study on Facebook News Feed Results". Information. 12 (12): 522. doi:10.3390/info12120522. ISSN 2078-2489.
  13. ^ a b Lischka, Juliane A; Garz, Marcel (August 1, 2023). "Clickbait news and algorithmic curation: A game theory framework of the relation between journalism, users, and platforms". New Media & Society. 25 (8): 2073–2094. doi:10.1177/14614448211027174. ISSN 1461-4448.
  14. ^ a b c d Berman, Ron; Katona, Zsolt (2017–2020). "Curation Algorithms and Filter Bubbles in Social Networks". Marketing Science. 39 (2): 296–316. doi:10.1287/mksc.2019.1208. ISSN 0732-2399.
  15. ^ Freeman, Sophie; Gibbs, Martin; Nansen, Bjorn (April 19, 2023). "Personalised But Impersonal: Listeners' Experiences of Algorithmic Curation on Music Streaming Services". Association for Computing Machinery: 1–14. doi:10.1145/3544548.3581492.
  16. ^ a b Bandy, Jack; Diakopoulos, Nicholas (April 22, 2021). "More Accounts, Fewer Links: How Algorithmic Curation Impacts Media Exposure in Twitter Timelines". Proc. ACM Hum.-Comput. Interact. 5 (CSCW1): 78:1–78:28. doi:10.1145/3449152.
  17. ^ a b c d e f Lazer, David; Swire-Thompson, Briony; Wilson, Christo (September 1, 2024). "A Normative Framework for Assessing the Information Curation Algorithms of the Internet". Perspectives on Psychological Science. 19 (5): 749–757. doi:10.1177/17456916231186779. ISSN 1745-6916.
  18. ^ a b Villermet, Quentin; Poiroux, Jérémie; Moussallam, Manuel; Louail, Thomas; Roth, Camille (September 13, 2021). "Follow the guides: disentangling human and algorithmic curation in online music consumption". Proceedings of the 15th ACM Conference on Recommender Systems. RecSys '21. New York, NY, USA: Association for Computing Machinery: 380–389. doi:10.1145/3460231.3474269. ISBN 978-1-4503-8458-2.