Focus Groups & Qualitative

Video Focus Groups: Setup, Tools, and Best Practices

6 min read

How to set up and run video focus groups. Platform comparison, technical requirements, moderation tips, and recording best practices for researchers.

Video Focus Groups: Setup, Tools, and Best Practices

What Are Video Focus Groups?

Video focus groups are live, moderated qualitative discussions conducted through video conferencing platforms where participants and the moderator interact face-to-face via webcam. They're the most common format for online focus groups because they preserve visual cues (facial expressions, reactions, body language) while offering the cost and recruitment advantages of remote research.

The format works for most qualitative research that doesn't require physical product interaction: concept testing, messaging evaluation, customer experience exploration, brand perception studies, and UX feedback sessions. With the right setup, video groups produce data comparable to in-person sessions at roughly half the cost.

Choosing a Platform

Your platform choice depends on the features you need beyond basic video conferencing.

Platform Type Examples Cost Best For
General video Zoom, Microsoft Teams, Google Meet Free-$20/host/month Simple groups, budget-conscious teams, participants already familiar with the platform
Dedicated qual platforms Recollective, FocusGroupIt, Discuss.io $200-$1,000/session Built-in research tools (annotation, highlight clips, stimulus presentation), teams running frequent qual
Integrated research platforms Quali-Fi, Remesh Varies by tier Teams combining qual and quant in one project, AI-powered analysis needs

For most teams starting out, Zoom is sufficient. It's familiar to participants (reducing tech friction), supports breakout rooms for sub-group exercises, includes built-in recording and transcription, and handles screen sharing for stimulus presentation. The main limitation is that Zoom lacks research-specific features like participant annotation, automated highlight clipping, and integrated coding tools.

When to invest in dedicated platforms: If you're running more than 10 groups per quarter, need client-facing observation tools that are more polished than a muted Zoom participant, or want built-in analysis features that connect directly to your discussion data.

Technical Setup Requirements

For the Moderator

  • Reliable internet: Wired connection preferred. Minimum 10 Mbps upload/download. Test before every session.
  • Camera and lighting: External webcam at eye level. Natural or ring light facing you (not behind you). Participants respond better to a moderator they can see clearly.
  • Audio: External microphone or quality headset. Built-in laptop mics pick up room echo and keyboard noise. A $50 USB condenser mic dramatically improves audio quality.
  • Dual monitors (recommended): One screen for the video gallery, one for your moderator guide and note-taking. Alt-tabbing during a session is distracting and visible to participants.
  • Backup plan: Phone dial-in number ready, chat window open as a secondary communication channel, second device charged and logged in.

For Participants

Send setup instructions 48 hours before the session. Keep them short and specific:

  • Use a computer (not a phone or tablet) with a working camera and microphone
  • Join from a quiet, private space
  • Close other applications to preserve bandwidth
  • Test the platform link 24 hours before the session (provide a test link)
  • Have the moderator's phone number in case of technical issues

Technology check at session start: Build 5 minutes into the beginning of every session for audio/video confirmation. Ask each participant to say their name and confirm they can see the screen share. This catches problems before the discussion begins rather than 20 minutes in.

Moderation Best Practices for Video

Video introduces dynamics that don't exist in person. These practices address the most common challenges.

Managing Turn-Taking

On video, participants can't read the physical cues that signal someone is about to speak. The result: either awkward silences (everyone waits for someone else) or talking over each other. The moderator needs to actively manage this.

Call on participants by name: "Maria, what's your take on that?" Use a round-robin approach for important questions where you want everyone's input. For organic discussion, establish a hand-raise protocol (physical hand or platform's raise-hand feature) so participants can signal they want to speak without interrupting.

Keeping Engagement High

Video attention fades faster than in-person. Participants have email, Slack, and their phones within reach. Counter this with:

  • Shorter discussion blocks. 10-15 minutes per topic, then shift. Don't spend 30 minutes on one question.
  • Visual stimuli. Share images, concepts, or video clips every 10-15 minutes. Visual changes re-engage wandering attention.
  • Direct address. Use participants' names frequently. "James, I noticed you reacted when Sarah mentioned the packaging. Tell us about that."
  • Polling or typing exercises. Mid-session, ask everyone to type their one-word reaction in chat before discussing. This creates a moment of individual engagement and prevents anchoring.
  • Session length. Cap at 90 minutes. 60-75 minutes is better for most topics. Anything beyond 90 minutes on video will show in the quality of responses.

Sharing Stimuli Effectively

Screen share is the primary stimulus delivery method. A few rules keep it smooth:

  • Share the specific window, not your entire screen. This prevents accidental exposure of moderator notes, participant lists, or client messages.
  • Give silent viewing time. After sharing a concept, say "Take 30 seconds to look this over before we discuss." This ensures reactions aren't anchored by whoever speaks first.
  • Keep stimuli simple. Dense text or complex layouts don't read well on smaller screens. Design stimuli for screen viewing, not printout.
  • Prepare all materials before the session. Having stimuli open in tabs, ready to share, prevents the fumbling that breaks session flow.

Recording and Documentation

  • Record video and audio. Confirm recording consent at the session start (include it in your introduction script).
  • Enable auto-transcription. Most platforms offer this. Clean the transcript afterward for accuracy.
  • Assign a dedicated note-taker. The moderator should not take notes during the session. Moderating and note-taking simultaneously means doing both poorly. A second team member watches the session and captures observations, non-verbal reactions, and group dynamics that transcripts miss.
  • Back up the recording. Download immediately after the session. Cloud recordings can expire, and platform outages happen.

Handling Technical Problems

Technical issues are inevitable. Having a protocol prevents minor glitches from derailing the session.

Participant drops off: If one participant loses connection, continue the discussion. They can rejoin. If two or more drop, pause and troubleshoot. Have a phone number ready for each participant as a backup audio channel.

Audio echo or feedback: Ask the affected participant to use headphones or mute when not speaking. If the problem persists, switch them to phone dial-in for audio.

Screen share fails: Have stimuli backed up as a file you can drop in the chat. Participants can open it on their own screen while you troubleshoot the share.

Platform outage: This is rare but happens. Have a backup platform (e.g., Google Meet link) ready and send it to all participants via text message. Even a 5-minute delay feels manageable when you have a backup plan.

Common Mistakes

  1. Not testing before the session. Run a full technical rehearsal with your note-taker using the exact setup (platform, stimuli, recording settings) you'll use in the live session. Discovering that your recording settings are wrong during a live session is a problem with no good fix.

  2. Too many participants. Six people on video is a discussion. Eight or more becomes a webinar where half the group goes passive. Keep video groups at 4-6 participants. For optimal group sizing, see the dedicated guide.

  3. Ignoring the gallery view. The moderator should see all participants simultaneously (gallery view), not just the active speaker. Reactions from non-speakers are data. A participant rolling their eyes or nodding enthusiastically while someone else talks tells you something.

  4. Forgetting about observers. If clients or stakeholders are watching, brief them before the session: cameras off, mics muted, use the private chat to send the moderator questions rather than speaking. An unexpected observer voice mid-discussion breaks the dynamic.

How Quali-Fi Supports Video Focus Groups

Quali-Fi's Research tier includes built-in video conferencing designed for research sessions. Recording starts automatically with consent capture, transcription runs in real time, and the moderator can present stimuli directly from the project dashboard. Observers get a separate view with a private back-channel chat to the moderator.

After the session, recordings and transcripts feed directly into Quali-Fi's AI-powered analysis tools, so you can start coding themes within minutes of the session ending. The integrated workflow eliminates the file-export-import cycle that slows down teams using separate platforms for video and analysis.

Frequently Asked Questions

Do participants need to have their cameras on?

Yes, for video focus groups. The point of the format is to see facial reactions and expressions. State the camera-on requirement in recruitment materials and the session introduction. Participants who aren't willing to be on camera are better suited for asynchronous focus groups or surveys.

What's the best group size for video focus groups?

4-6 participants. Four is the minimum for meaningful group discussion. Six is the maximum before video conversations get unwieldy. Some moderators prefer 5 as the sweet spot. See the focus group size guide for detailed recommendations.

How long should a video focus group last?

60-90 minutes. 60 minutes works for single-topic discussions. 90 minutes is the maximum for multi-topic sessions with stimuli. Don't schedule beyond 90 minutes; participant attention and response quality decline sharply after that mark.


Run video focus groups with built-in recording and AI analysis -- try Quali-Fi free for 14 days.

Frequently Asked Questions

Related Guides

Put it into practice

Ready to apply this in your research?

Quali-Fi makes it easy to run surveys, conjoint studies, and more, all in one platform.