1 Introduction
Social media feeds shape our everyday online interactions with others. Their interface designs and affordances define boundaries on how we can engage with people and content, while their algorithms dictate both what and who we engage with in the first place. Taken together, they shape our collective behaviors and social norms, cementing their role as social architects in the digital age.
The rise of algorithmic content curation and distribution via feeds happened alongside a key shift in the ownership structure of social networks—in the past 20 years, social media platforms transitioned away from being hosted by distributed, independent servers to operating under a small number of private corporations reaching millions of people worldwide [
22]. The funneling of capacities for algorithmic curation into the hands of a select few results in what Reviglio et al. call a lack of “algorithmic sovereignty” [
91]. It is by this process that the complexity and richness of our social realities have been distilled into a small number of homogenized parameters. In the face of seemingly ubiquitous promises of in-feed personalization, this has brought about a “personalization paradox” [
99].
Increasingly centralized curation can have significant negative consequences for users’ agency [
6,
45,
54,
67,
85,
93]. Platforms today often employ a top-down, one-size-fits-all approach when it comes to platform design and governance. In doing so, they marginalize those who fail to conform to the platform-wide majority. For example, user groups with culturally significant patterns of language use may have their content mislabeled and “downranked” by platforms [
45,
93], while neurodiverse users may find many posts too overwhelming to consume [
85]. Even users who fit within the majority may get frustrated from being shown irrelevant content with no efficient way to set controls that would eliminate such content from their feeds.
Prior work has documented users’ attempts to reclaim their agency by deriving algorithmic folk theories to probe black-box feed curation algorithms [
23,
30,
31,
51,
63,
98] and “teaching” these algorithms to better align with their preferences through strategic in-feed interactions [
30,
55]. The efficacy of these ad-hoc techniques, however, is often unclear, and using them can even leave users with undesirable feelings of coercion and manipulation [
14]. Why might this be? The problem is unlikely to be rooted in the
quantity of feedback that the user provides to the algorithm—after all, modern recommender systems leverage a wide variety of both implicit (e.g., content dwell time, mouse movements) and explicit (e.g., likes, blocks) feedback elicitation techniques [
57,
73,
78,
104], sometimes learning user preferences to a startling degree of accuracy [
99]. Instead, we posit that this is due to users’ lack of opportunity to
agentially articulate pertinent feedback to the algorithm, and have the algorithm respond accordingly based on their feedback.
Given this, we draw upon literature in interactive machine teaching (IMT) [
79,
90,
111] to chart out avenues for enabling
teachable feed experiences1 on social media. IMT proposes an interaction framework by which a user (the human teacher) without expertise in machine learning can train a model (the algorithmic learner) to accomplish desired tasks with limited amounts of pre-labelled data. Applications of IMT in social media settings, however, has been limited. To extend IMT’s framework to social media, we conducted a think-aloud study (
N = 24) with users from four feed-based social media platforms with diverse cultures and affordances—Instagram, Mastodon, TikTok, and Twitter
2—to answer the following question:
What are prominent signals relied upon by users to judge the value of content in their feeds, and are thus amenable to teaching to an algorithmic learner?
We define a “signal” as a pair of one feature (a category of information that can be extracted from a post, such as the
author or
included hashtags) and one characteristic (a statement describing the significance of the feature, such as
“is part of a recent fashion trend I’ve been following” for the feature
included hashtags)
3. We define “value” broadly based on the
desired order of consumption: Post A has a higher value than Post B if the user prefers to see A before B in their feed.
We find from our study that users leveraged a variety of signals to evaluate content from their feeds. These evaluations are nuanced in ways that current preference elicitation methods, such as “likes” or content dwell time, may fail to capture. Many users also expressed desires for feed experiences centered around individuals they cared about rather than content-based recommendations, better management of saved content, and more agency—in particular self-causality—when curating their feeds. Supplementing our findings with prior work on IMT, we offer five IMT-inspired design principles for teachable feeds. Finally, we embody these principles into three proposed feed designs that serve as sensitizing concepts to catalyze future research in this area.
Concretely, our paper offers the following contributions:
(1)
Cross-platform taxonomies of prominent signals used to determine the value of posts in social media feeds, enriched by themes extracted from user interviews.
(2)
Five principles to guide the design of teachable social media feed experiences.
(3)
Three proposed feed designs that illustrate our principles and serve as sensitizing concepts for teachable social media feed experiences going forward.
These contributions pave a path to equipping today’s social media systems with novel design patterns to empower agential, personalized feed curation.
5 Design Principles for Teachable Feed Experiences
A primary objective of IMT is to leverage an end user’s subject-matter expertise to train a learnable agent that can effectively aid the user in fulfilling their desired goals [
79,
90,
119]. However, teaching languages
10 proposed in prior work, such as PICL [
90] and Pearl [
48], may not translate smoothly to social media settings—they demand high-effort, meticulous interactions from users to provide meaningful labels and descriptions from data examples. Given that social media is designed to support fast-paced consumption of information, actively introducing excessive friction via conventional IMT interfaces may worsen the user experience rather than improve it.
Based on our taxonomy and participant interviews, along with prior literature in IMT, we outline five design principles to set the stage for teachable social media feed experiences. We do so to extend the core principles of IMT to the modern social media landscape. In some cases, this extension directly builds off of classic IMT principles; in others, those principles are re-examined and reconfigured. Our principles may be used independently or (ideally) in combination.
D1: Situate the teaching language within the feed. In IMT, teaching has conventionally been performed in isolation from the environment in which the teaching materials originated. For example, when creating a recipe classifier in PICL [
90], the user first imports a set of documents containing recipes before they can start teaching. Note that PICL is a separate environment from where users may encounter recipes in-the-wild, such as on websites. A separate teaching environment can enable more feature-rich and expressive teaching languages, but it can also disengage users from the act of teaching in social media settings. For one, the additional effort required to move posts and organize them outside of the feed is already seen as burdensome by our participants (see Section
4.2.3). Additionally, it may not be in platforms’ best competitive interest to support exporting a post and any informative metadata off the platform. We therefore situate the teaching language
within the feed itself. This way, we can directly leverage platforms’ existing representations and users’ existing mental models, while lowering the effort required to perform teaching. A key question then arises: what exactly do users’ mental models of current platform representations look like in the context of feed curation? Our taxonomies shed valuable light on this; we can leverage salient features identified in our taxonomies to inform the design of our in-feed teaching language.
D2: Be available, but not intrusive. We heard from many participants that social media can act as a much-needed break or distraction in the middle of the day, a means of relaxation, and a casual time-killer when small pockets of idle time arise. That is, participants were satisfied simply by letting the algorithm entertain them. In these scenarios (described by P15[TW] as
“low focus parsing mode”), persistently demanding extra attention and effort via IMT may in fact worsen the user experience and force users to expend more energy than they desire. We saw in practice that many participants switched between this low-focus mode and a more attentive, information-seeking mode (see Section
4.2.1), and that this switching was largely dependent on difficult-to-predict factors such as mood and context. In light of this, we ensure that our teaching language is available, but unobtrusive. That is, the interface is easily accessible to users across the entire feed experience, but can also be easily dismissed or hidden when not needed.
D3: Embrace a multiplicity of feeds. Some of our participants questioned why so many feed experiences had to be constrained to a singular feed and expressed a desire for multiple feeds to better organize their content. P12[IG] touched on the oddness of having diverse content mix in their feed:
“it’s weird to go past someone’s bikini photo and then, ‘5 people killed at a refugee camp.”’ Indeed, the range of characteristics used to describe features of posts as captured in our taxonomy corroborates this diversity. As a less extreme example, it may be jarring for users to see content that “contains an existing professional interest” if they expect to scroll through posts whose content “is funny.” P16[MA] mentioned that they would like to create different feeds for the various Mastodon instances they were on. P8[IG] likened their feed to a newspaper and suggested different sections:
“news, updates, events, pop culture, etc.” In fact, platforms have also started to explore multi-feed experiences and broadening of algorithmic choice. Many, including Twitter, now offer an engagement-based “For You” feed alongside a “Following” feed featuring content from followed accounts. One smaller social platform called Bluesky has introduced Custom Feeds, where developers can create feed algorithms to which users can browse and subscribe [
9]. TikTok has also introduced topic feeds based on inferred user interests [
71]. We consider this shift towards feed multiplicity a promising direction, especially when providing users with a means of organizing their IMT curriculum.
D4: Seek structured and unstructured feedback. IMT workflows have traditionally been scaffolded with affordances that elicit structured feedback (e.g., data highlighting and labelling). Our findings in Section
4.2.4 and the range of both positive and negative feature-characteristic pairs in the taxonomy suggest the need for teachable algorithms that accept both structured and unstructured feedback in order to capture users’ nuanced evaluation of social media posts, which corroborates existing research in IMT. For example, Jörke et al. [
48] provided a form-like interface for specifying key teaching concepts in which some fields allow the selection of existing data tags while others allow the user to write freeform natural language. Likewise, Zhou et al. [
119] noted the importance of relying on users’ unstructured object show-and-tell gestures in addition to a structured UI. This combination of structured and unstructured feedback is desirable in real-world applications as users may make nuanced judgements that cannot be captured with structured feedback alone, but also require some guidance to express concepts in a format parsable by an algorithmic learner. Parsing unstructured input, however, is now significantly less challenging due to technological advancements in large language models. We take this into consideration when balancing structured and unstructured feedback.
D5: Enable teaching and evaluation at varying timescales. Bennett et al. [
7] identified timescales of interaction—ranging from
micro-interactions (a few seconds or less) to
episodes (seconds to hours) to
life (days to years)—to be a key aspect of human agency and autonomy. When asked to articulate their preferences with limited access to their feeds during the study, many participants had difficulty doing so because their preferences
evolved over time based on the content they saw. As such, they expressed a desire to agentially refine their feeds’ behavior over an extended time period. This also happened to be less cognitively demanding, as P6[TW] pointed out:
“I don’t have to intentionally be like, okay, I’m gonna sit down and coordinate everything.” Our participants also described how their preferences shifted temporarily based on factors like mood (see Section
4.2.4). Indeed, we can see the pitfalls of assuming static preferences and designing only for interactions at short timescales via the inefficacy of set-and-forget personal content moderation tools [
44]. That said, classic debates in HCI over direct manipulation interfaces (UIs providing immediate control feedback through elements such as buttons and sliders) versus interface agents (systems that perform actions on behalf of users, often after learning user preferences over time) reveal that aspects of both are vital in information-dense environments [
96].We thus explore designs that enable teaching and evaluation at varying timescales. Specifically, we aim to leverage teaching interactions afforded through direct manipulation as a familiar interaction pattern to ground evaluations of algorithm performance over longer timescales. In doing so, we situate the teaching language as not only a tool for the teacher to articulate key concepts, but also one that aids evaluation of the algorithmic learner.
6 Proposed Feed Designs
We now propose three feed designs for teachable social media feed experiences that embody our findings and design principles. Our goal is not to constrain the design space of teachable feeds with these designs, but rather present them as sensitizing concepts—emergent ideas that help direct attention to promising topics or phenomena [
10,
120]—to illuminate salient paths for future research.
Note that the mockups illustrating our feed designs feature a generic microblogging platform, similar to Twitter or Mastodon. Our reasons for making this decision were twofold. First, we wanted to show how our designs can operate on platforms with a diverse range of feature and characteristic preferences, as opposed to ones like TikTok where a particular media modality (and therefore certain sets of features and characteristics) is significantly prioritized over others. Second, we tap into the increasing design and development interest in microblogging alternatives since Twitter’s change in ownership [
47,
97]. Many platforms in this space, including Mastodon and Bluesky, are also open source, making experimentation with novel ideas more accessible than on closed platforms.
6.1 Exploded UI Views
In 3D diagramming, an exploded view is one where the individual components of an object are shown slightly separated from each other as if a small explosion occurred at the center of the object. Exploded diagrams depict inter-component relationships and are thus commonly found in instructional manuals. We employ a similar concept in a post’s in-feed UI to serve as a teaching language for the elicitation of more granular preferences. On current platforms, “liking” a post can signal that the user enjoys some feature(s) in the post, but it does not clearly communicate what specific feature(s) they found like-worthy. An exploded UI view aims to recover this information by allowing the user to specify features within the post that they find (un)appealing.
Once a user expresses a generic signal (e.g., like, reshare) on a post, the UI explodes out into individual features, such as the author, text descriptions, attached media, and hashtags (all features in our taxonomies—see Section
4.1). These features may be directly available in the post or algorithmically extracted—indeed,
topic was a popular feature in our content-based taxonomy that needed to be inferred from posts (Fig.
5). In Fig.
6, we distinguish inferred features from extracted ones by rounding out their UI. The user can then engage in teaching the feed algorithm their preferences in this exploded view by selecting the features that they consider (ir)relevant. While we could further elicit detailed characteristics associated with those features, just like we did in our study’s Miro board, we reduced this down to simple plus and minus buttons (indicating a positive or negative characteristic, respectively) to avoid overburdening the user.
We ensure that “explosion” happens within the feed so that teaching can be done without leaving the feed, satisfying
D1. Additionally, since the exploded view can occupy more space than the normal post UI, especially when there are numerous features available to teach with, we provide a convenient option for the user to collapse back the exploded view (see Fig.
6 (B)). The user can also simply scroll away. This simple dismissal of the exploded UI aligns with
D2. Finally, even as direct manipulation interfaces affording immediate interaction, exploded UIs allow users to gradually accumulate preferences informed by what they view, satisfying
D5.
6.2 Multi-Feed Curriculum Organization and Seeding
A teaching language alone cannot close the teaching loop. Here, we propose a design in which assembling the teaching curriculum and evaluating the learner’s performance is closely integrated with the teaching language via feed multiplicity. Key to this design is the use of curriculum organization to curate a multi-feed experience.
As a user expresses preferences using a teaching language, those preferences are saved into curriculum “folders.” Due to the differing nature of account- and content-based preferences, we separate the two in Fig.
7. A folder can then spawn a new feed that aims to provide more focused content that adheres to the folder’s theme. The post on which the user first expressed preferences becomes a “seed” for the new feed to guide the recommendation of related content. This multi-feed experience serves as a way for users to evaluate the algorithmic learner’s performance—a relevant and well-curated feed is a sign that the learner is effectively acting on taught preferences. Otherwise, the user can provide feedback to the learner through the in-feed teaching language, closing the teaching loop.
If a user does not want a folder to form a feed, they can toggle it off in the curriculum; folders created from negative feedback are toggled off by default. Furthermore, as participants pointed out in Section
4.2.3, lightweight algorithmic interventions, such as suggesting folders for unorganized areas of the curriculum (see the “Unsorted” folder’s Sort For Me option in Fig.
7) can also aid with curriculum organization.
Given that user trust hinges on observed learner performance [
76,
111], additional support may be added to further facilitate learner evaluation. One possible approach is to reuse already-familiar interaction patterns in the teaching language scaffold evaluation. Fig.
8 offers an “Explain” option for posts recommended by the learner in a feed formed from a folder. Selecting that option would expand the post into an exploded UI with pre-selected preferences that the learner infers from existing ones. The titles of features then become explanations that link back to folders and other curriculum material used by the learner to infer that preference. With limited work in explanation and evaluation techniques in IMT [
107], employing the teaching language as an aid for evaluation suggests one approach to mend this gap.
This design’s use of feed multiplicity embodies
D3. Additionally, unlike the set-and-forget approach to creating custom feeds on Bluesky [
9], this design enables users to iteratively build, refine, and evaluate their feeds in the spirit of
D5. This design can also be easily dismissed (
D2): users can toggle feeds off from within the curriculum and the curriculum itself is located in another tab separate from regular feed activities. However, if the user does choose to engage with this design, the proximity to and reliance on an in-feed teaching language satisfies
D1.
6.3 Purposefully Finite Feeds with Natural Language Feedback
The infinite scroll is a dominant design pattern in contemporary social media. Implementation-wise, this effect is achieved by loading posts in batches such that another batch of content is quickly available once the user reaches the end of the previous batch. What if we can repurpose this transition between batches into an opportunity for preference elicitation and reflection?
In this feed design, we explore what it means for feeds to be purposefully finite. We split a feed into individual “stacks” of content; users can set the size (number of posts) of each stack. When the user reaches the end of a stack, they are presented with a teaching language in the form of a text input area in which they can specify, in natural language, any preferences to incorporate into future stacks. The combination of finite stacks and natural language feedback addresses two insights participants raised in our study. First, preferences may not be well-formed before consuming content—by allowing users to first view content and then reflect on what they viewed, users may be able to articulate their preferences with more precision. Second, the feed was used to both browse and search for content. An adjustable stack size allows a user to smoothly transition between the two consumption modes. With a low stack size and frequent feedback input, the feedback starts to resemble intent-driven search queries, while a high stack size brings the experience closer to that of a conventional infinitely scrolling feed. On top of all this, unstructured natural language allows users to specify nuances that may be difficult to communicate through structured means, such as UI buttons.
The accumulation of natural language feedback also presents novel interaction opportunities. Users may use consecutive pieces of feedback to assemble a chain of preferences that can help them discover content with specific features and characteristics. Users can then exit from these more focused views by deleting preferences from their chain, or removing the entire chain to start afresh. Users’ preferences can also be automatically summarized by the algorithmic learner into “Observations” (see Fig.
9) are shown to the user for additional reflection. These observations can be edited by the user to guide future recommendations, similar to editable natural language user profiles in modern conversational recommender systems [
36].
This natural language feedback, when used in combination with the more structured feedback presented in Section
6.1, results in an expressive set of teaching languages (
D4). Like exploded UI views, it also operates in-feed, per
D1. If the user does not wish to engage in this design, they may simply proceed to the next stack without providing feedback, or revert to an infinitely scrolling feed by setting the stack size to infinity, satisfying
D2. Finally, by first accumulating feedback over time as they view content, users can refine system behavior over longer timescales, while still being afforded the ability to directly and immediately update the system’s learned knowledge as preferences evolve (
D5).
8 Conclusion
An architect may design a home, but it is ultimately up to the home’s occupants to decorate the space to reflect their unique tastes and preferences. Today, social media feeds act as digital architects of personalized social spaces, but users often lack such decorative agency.
In this work, we explored the idea of teachable social media feed experiences for agential, personalized feed curation. To do so, we conducted a study with 24 social media users across Instagram, Mastodon, TikTok, and Twitter to elicit key signals they used to determine the value of posts in their feeds. We found that users evaluated content in multi-faceted and nuanced ways that cannot be fully captured by affordances on current platforms. To enable users to better “teach” an algorithm their preferences, we offered five IMT-inspired five design principles for teachable feed experiences, informed by findings from our study. We then embodied these principles in three feed designs to inspire future efforts on integrating teachable feeds into real-world social media systems.
Altogether, our contributions lay the groundwork for continued exploration of teachable feed experiences. We hope this exploration can empower users and platforms to craft more comforting and expressive digital homes.