Big Picture 9 min read

When Coworkers Become a Team (and When They Do Not)

Most groups of coworkers never cross the threshold to a real team. Five conditions decide whether your group becomes one, and the work to set them up has to happen before anyone enters the room.

By Asa Goldstein, QuestWorks

TL;DR

Coworkers do not become a team by sharing an org chart, a manager, or a Slack channel. Research on real teams from Hackman, Katzenbach, Wageman, and Edmondson points to five threshold conditions: shared goal, mutual dependency, interdependent work, shared identity, and repeated reps under pressure. Hybrid work strips out the informal interaction where most of those conditions used to form. Managers who design the threshold deliberately get teams; managers who hope for it get coworkers in a meeting.

Ask the eight people who report into the same VP whether they are on the same team. You will hear eight different answers.

Ruth Wageman, Debra Nunes, James Burruss, and Richard Hackman studied this directly. In their work on more than 120 senior leadership teams, "almost every team said it had set unambiguous boundaries on membership, yet fewer than 10% agreed about who was on it". The group thought it was a team. The members did not even agree about who counted.

That is the gap. The org chart says team. The work says group. Most of the time, the work is right.

What Separates a Group From a Team

Jon Katzenbach and Douglas Smith published the cleanest definition in 1993 in The Discipline of Teams. They put workplace collectives on a continuum: working group, pseudo-team, potential team, real team, high-performance team. A working group is a set of people who share information and make individual decisions; their performance is the sum of what each member does alone. A real team is small, has complementary skills, shares a goal, depends on each other to reach it, and holds itself jointly accountable for the result. Most groups stop at working group. Most do not even know there is anything further along.

The Ringelmann effect, first documented in 1913 and replicated repeatedly since, is the structural reason groups underperform what their members could deliver. Per-person pulling force on a rope drops to roughly half by a group of eight. Alan Ingham's 1974 replication isolated the cause: motivation loss, not coordination loss. People in groups give less because they can. The org chart cannot fix that. Only structural interdependence can.

Richard Hackman's "real team test" gives the structural answer. In his framework, a real team is bounded (you know who is in it), stable (membership does not churn weekly), interdependent (the work requires the others), and operates inside clear authority. Skip any of those and you have a group that meets, not a team that performs. Most teams skip three.

The Five Threshold Conditions

Cross-referencing the team-effectiveness literature, the threshold reduces to five conditions. A group that has them is a team. A group missing any of them is a group.

1. A Shared Goal

Hackman's first condition is a compelling direction. Vague missions and quarterly OKR lists do not count. A real shared goal is a single objective the team holds in common and could state in one sentence. The Team Diagnostic Survey research validated by Wageman and colleagues across 2,474 members and 321 teams found that the six conditions, with direction first among them, account for up to 80% of the variance in team effectiveness. Most direction-setting falls apart the first time the team negotiates priorities. If three teammates are each optimizing for three different outcomes, the group cannot be a team. It can only be coworkers running on parallel tracks.

2. Mutual Dependency

The Navy SEAL trust matrix is the cleanest example of this condition in extremis. SEAL teams select for trust over raw performance because the work is structurally life-and-death interdependent. High performers with low trust are rejected. The selection logic is mutual dependency: if I cannot count on you to cover my flank, your skill is irrelevant to me. Knowledge work rarely reaches that intensity, but the principle holds. If the group could deliver the outcome with any teammate swapped out and nothing meaningful would change, the team is not actually a team. It is a roster.

Spotify's well-documented Squads model is the cautionary version. The squads were autonomous and, on paper, interdependent. When alignment weakened, Jeremiah Lee's internal retrospective "Failed Squad Goals" describes how the codebase fragmented and squads optimized locally. Autonomy without dependency produces silos with team names on them.

3. Interdependent Work

Dependency is the disposition; interdependence is the work. Mathieu and colleagues' 2000 Journal of Applied Psychology study showed shared mental models, the overlap in how teammates represent the work and each other's roles, predict both team process and performance. Teams that share a model coordinate without talking about coordinating. Teams that do not share a model spend most of their meeting time clarifying what they already think they agreed about last week.

Alex Pentland's MIT research on communication patterns in real workplaces, summarized in The New Science of Building Great Teams, found that observable communication patterns predicted team success as much as all other variables combined. The healthy pattern: roughly equal turn-taking, around twelve exchanges per hour, energy and engagement outside formal meetings as much as inside them. In one Bank of America call center study, the energy of informal interaction explained roughly a third of dollar productivity variance. Interdependent work generates that pattern. Independent work does not.

4. Shared Identity

People do not commit to a depersonalized group. They commit to a we. Pixar's Braintrust is the cleanest example of identity-by-practice. Ed Catmull's HBR write-up of how Pixar fosters collective creativity describes the Braintrust as a recurring peer-review group with no formal authority and one rule: candor about the work. The identity is forged in the practice of giving each other hard feedback on unfinished work and surviving it. Catmull called it the antidote to a "stack of brilliant soloists."

Identity is also the social glue that lets the rest of the conditions hold under pressure. When a team has one, it can absorb a disagreement without rupture; when it does not, every conflict re-litigates whether the team should exist.

5. Repetition Under Pressure

Bruce Tuckman's 1965 forming-storming-norming-performing model is taught in every management course and is methodologically thin. The model was based on a literature review, not field data, and later researchers, including Connie Gersick's punctuated-equilibrium work, found teams do not actually move through neat sequential stages. What does hold is the underlying claim that teams need repeated reps to become teams. You cannot norm without something to norm around.

Hackman and Wageman's 60-30-10 rule is the most useful version of this. Of the variance a coach can influence, roughly 60% is set in pre-launch design (membership, purpose, structure), 30% in the launch itself, and only about 10% in real-time coaching once the team is running. Most managers invert that ratio, hoping to coach their way out of a launch they never designed. The rule says you cannot. The reps work only when the conditions were set before the team met.

Why Hybrid Breaks the Threshold

The threshold conditions used to form by accident. People walked past each other's desks, overheard problems, and rebuilt shared mental models continuously. Hybrid stripped that out and replaced it with scheduled calls.

Microsoft's 2022 Work Trend Index documented the gap. 85% of leaders said the shift to hybrid had made it harder to have confidence in employee productivity. Only 12% had full confidence their team was productive, against 87% of employees who said they were. 49% of hybrid workers said they struggled to trust their teammates, versus 36% of fully in-person workers. Visibility into team activity dropped from 54% to 38%.

Amy Edmondson and Mark Mortensen extended this to the threshold question directly in a 2021 HBR piece. Hybrid does not just lower trust averages. It distributes the loss asymmetrically. Remote teammates pay a higher candor tax than in-person teammates, which means the very signals the team needs to develop shared identity travel less freely the more distributed the team is.

Gallup's 2024 State of the Global Workplace places the cost. Managers account for 70% of the variance in team engagement. Global engagement sits at 21%, with an estimated $438B in lost productivity attached. The threshold conditions are the levers under most of that variance. When they are missing, you do not get a worse team. You get coworkers.

What Managers Should Do

The instinct is to schedule a team-building day, do some name games, and call it formed. The meta-analytic evidence says that does not work. Eduardo Salas's 2016 PLOS ONE meta-analysis on team training found team training shows a medium effect on performance overall, with role-clarification interventions reliably moving outcomes while interpersonal-relations interventions do not. When team-building tells you who does what and how the work fits, it works. When it tells you to share a fun fact about yourself, it does not.

The action list is straightforward and front-loaded, in line with the 60-30-10 rule:

  1. Bound the team. Name every member in writing. If you cannot list them in one minute, the team is unbounded. Fix that before anything else.
  2. State the goal in one sentence. Test it by asking every member to write the team's purpose privately. If their sentences do not match, you do not have a shared goal yet, you have a shared meeting.
  3. Audit the interdependence. If two teammates can deliver their work without ever needing each other, either change the work or accept that they are colleagues, not teammates.
  4. Design the reps. Schedule a recurring, low-stakes situation where the team has to decide together. Not status, not show-and-tell. A decision they own jointly.
  5. Pre-launch first, coach second. Spend disproportionate time before the team's first real task. After it starts, your leverage drops fast.

For new teams crossing the threshold for the first time, the order in which you do these matters as much as that you do them. We mapped a week-by-week version of this in The First 90 Days of a New Team and a structured menu of formative reps in Team Building Activities for New Teams. For teams already running, the diagnostic is what work actually requires the others. We unpacked that in Shared Fate, and the broader pattern of what high-performing teams share is collected in What Makes a High-Performing Team.

The Counterpoint: Real Teams Are Demanding, Not Comfortable

Psychological safety is the single most-cited concept in modern team research and the single most misused. Edmondson herself, in a May 2025 HBR piece, named the failure mode directly: psychological safety has become a cushion for mediocrity in many teams. Safety to take interpersonal risk was never supposed to mean safety from accountability for the work.

The Project Aristotle research from Google made psychological safety famous as the top of five team dynamics. The research is widely cited and never peer-reviewed, and replication standards in social psychology have tightened considerably since it was published. Treat it as one data point among many, not a settled finding. The harder counter holds either way: real teams are demanding precisely because the members care about the outcome and about each other. Comfort is a side effect of high standards being held, not a substitute for them. Polite teams are not safe; they are stalled.

Anita Woolley and colleagues' 2010 Science paper on collective intelligence in 699 individuals across 192 groups is consistent with this. The team's "c-factor" was not correlated with average or maximum IQ. It was correlated with social sensitivity and equal turn-taking. A group of highly capable individuals with no equal participation underperforms a group of average individuals with balanced contribution. The threshold is not built from talent. It is built from the conditions that let the talent operate as a unit.

Where QuestWorks Fits

Most of the conditions above are not knowable from a survey. They show up in behavior under mild pressure: who steps up, who defers, who pools unique information, who routes around the group. The reason hybrid teams stay stuck below the threshold is that their reps under pressure mostly happen during real work, where the stakes are high and the social cost of experimenting is higher.

QuestWorks runs on its own cinematic, voice-controlled platform. Each week, teams of 2-5 take 25 minutes to make decisions together inside AI-facilitated quests. The format supplies the repetition condition directly: a recurring, interdependent, identity-forming rep where the team makes joint decisions in a low-stakes environment. HeroGPT coaching, available in Slack, gives each player private feedback the team never sees. The Weekly Team Health Report gives leaders aggregate signal on participation patterns and decision velocity. Nine public HeroTypes give the team a shared language for who tends to step up where, the way Pixar's Braintrust gave its members a shared language for what kind of feedback they were getting.

Pricing is $14 per user per month for the first 50 companies (Founder's Circle, locked forever) or $20 per user per month standard, with a 10-day free trial. Participation is voluntary. Start a free trial and run weekly quests with one of your teams. After a few reps, the behavior under pressure will tell you whether you have a team or a roster.

Frequently Asked Questions

A group is a set of coworkers who share an org chart, a manager, or a meeting cadence. A team shares a goal, depends on each other to reach it, holds joint accountability for the outcome, and has a stable, bounded membership. Katzenbach and Smith framed it as a continuum from working group to real team to high-performance team. Most coworkers stop at working group and never cross.

Hackman's research points to five thresholds: a shared compelling goal, mutual dependency on each other, work that is structurally interdependent, a shared identity, and repeated reps together. Wageman and Hackman's data on senior leadership teams found that fewer than 10% of teams agreed on who was even on the team, which means most groups fail at the first condition before they reach the rest.

Hybrid removes the unscripted reps where teams used to form. Microsoft's 2022 Work Trend Index found 85% of leaders had reduced confidence in employee productivity and 49% of hybrid workers struggled to trust their teammates. Without informal interaction, identity does not form, shared mental models do not develop, and the group stays a group of coworkers.

Hackman and Wageman's 60-30-10 rule puts 60% of team effectiveness in pre-launch design: clear membership, clear purpose, real interdependence, and enabling structure. Salas's meta-analyses show role-clarification interventions reliably move performance while interpersonal-feelings interventions do not. Design the conditions first, then add repeated reps where the team makes decisions together.

QuestWorks runs 25-minute weekly quests where 2-5 teammates make decisions together on a cinematic, voice-controlled platform. The format supplies the repeated, interdependent, identity-forming reps that hybrid teams otherwise never get. HeroGPT coaching in Slack stays private, while the Weekly Team Health Report shows leaders aggregate signal on participation and decision velocity. Pricing is $14 per user per month for the first 50 companies or $20 per user per month standard, with a 10-day free trial. Participation is voluntary.

Ready to Level Up Your Team?

10-day free trial. Install in under a minute.

Slack Microsoft Teams Try it free
Team Intelligence™, powered by play.SlackMicrosoft Teams Try QuestWorks Free