Skip to main content

Meta-Style Problems

Reading time: ~45 min | Interview relevance: Critical (Meta/Instagram/WhatsApp) | Roles: Machine Learning Engineer, Software Engineer (ML), Research Scientist, Data Scientist, ML Infrastructure Engineer at Meta

Meta interviews move fast. Where Google wants you to discuss three approaches before coding, Meta wants you to pick the right approach immediately and execute cleanly. Where Google gives you 45 minutes to explore a problem, Meta often packs two problems into 45 minutes.

Meta's ML interviews are deeply product-oriented. Every ML system design question ties back to a specific Meta product: News Feed ranking, Instagram Reels recommendation, Marketplace fraud detection, WhatsApp spam filtering. If you cannot connect your ML knowledge to product impact, you will struggle in Meta interviews.

This list of 30 problems is calibrated to Meta's actual interview style based on publicly reported experiences from 2023-2025. The emphasis is on speed, product ML, recommendation systems, and practical execution.

Meta Interview Structure (ML Roles)

RoundDurationEvaluation FocusMeta-Specific Notes
Coding 145 minDSA (2 problems)Two problems, not one. Speed is essential
Coding 245 minML coding or system codingImplementation of ML algorithms or data systems
ML System Design45 minEnd-to-end ML systemAlways tied to a Meta product
Behavioral45 minImpact, collaboration, growthMeta values: Move Fast, Be Bold, Focus on Long-Term Impact

:::tip What Makes Meta Interviews Unique

  1. Speed. Meta coding rounds typically include two problems in 45 minutes. You have ~20 minutes per problem.
  2. Product ML. Every system design ties to a real Meta product (Feed, Reels, Marketplace, Ads).
  3. Recommendation systems. Meta is fundamentally a recommendation company. Expect recommendation-focused questions.
  4. Execution over exploration. Meta values candidates who can identify the right approach quickly and execute.
  5. Impact stories. Behavioral rounds focus heavily on measurable business impact. :::

Section 1: Coding Problems (12 Problems)

Meta coding rounds are fast-paced. You will likely face two problems in 45 minutes. The first is usually medium, the second is medium-hard. Speed is critical.

Fast-Execution DSA

#ProblemDifficultyTimeKey PatternMeta LensWhy Meta Asks This
1LRU CacheMedium15 minHash map + DLLExecute fast, no bugsFeature caching in recommendation serving
2Binary Tree Right Side ViewMedium12 minBFS level-orderQuick traversal thinkingTree structures in content taxonomies
3Add and Search Word (Trie + DFS)Medium18 minTrie with wildcardData structure selectionSearch autocomplete, content matching
4Random Pick with WeightMedium15 minPrefix sum + binary searchProbability thinkingWeighted sampling for ad selection, A/B testing
5Minimum Window SubstringHard20 minSliding window + hash mapOptimal solution directlyText analysis, content matching
6Alien DictionaryHard22 minTopological sort from constraintsGraph modelingRanking order inference, dependency detection

ML-Adjacent Coding

#ProblemDifficultyTimeKey PatternMeta LensWhy Meta Asks This
7Implement Batch Gradient DescentMedium15 minVectorized computationNumPy fluencyCore training loop for any model
8Implement AUC-ROC from Sorted PredictionsMedium15 minThreshold sweepMetric implementationPrimary metric for ranking and classification models
9Implement Feature Hashing (Hashing Trick)Medium15 minHash function + modular indexingSparse feature encodingHigh-cardinality features in ad/content ranking
10Implement Online Learning Update (SGD with Decay)Medium15 minGradient update + learning rateReal-time model updatesOnline learning for fresh ranking signals
11Implement Stratified SamplingMedium12 minProportional selectionData handlingBalanced training data for imbalanced classes
12Implement Cosine Similarity for Sparse VectorsEasy10 minSparse dot productEfficient computationContent similarity, embedding comparison

:::warning Meta Coding Speed Benchmarks

  • Must solve a medium problem in 12-18 minutes (not 25-30)
  • Must solve a medium + hard pair in under 40 minutes
  • Must write bug-free code on the first pass (limited time for debugging)
  • Must handle edge cases without being prompted (empty input, single element, overflow)

If you consistently take >20 minutes for medium problems, focus on speed drills before your Meta interview. :::

Section 2: ML System Design (12 Problems)

Meta ML system design is always product-oriented. The interviewer will name a specific Meta product and ask you to design the ML system behind it. You must demonstrate product intuition alongside ML expertise.

Core Product ML

#ProblemDifficultyTimeProductKey ChallengeWhat Meta Evaluates
13Design News Feed RankingHard45 minFacebook FeedMulti-objective optimization (engagement, time well spent, integrity)Understanding of ranking at massive scale with competing objectives
14Design Instagram Reels RecommendationHard45 minInstagram ReelsCold-start for new content, short-video engagement modelingExploration-exploitation for creator ecosystem health
15Design People You May Know (PYMK)Medium35 minFacebook SocialGraph features + ML ranking, privacy constraintsCombining social graph structure with ML scoring
16Design Facebook Marketplace Item RankingMedium35 minMarketplaceLocation-aware ranking, seller quality, fraud detectionMulti-signal ranking with trust and safety
17Design Instagram Explore FeedMedium40 minInstagram ExploreInterest discovery, topic diversity, freshnessBalancing personalization with content discovery
18Design WhatsApp Spam DetectionMedium35 minWhatsAppEncrypted messages (no content access), behavioral signalsML without content features; metadata and behavioral modeling

Ads & Monetization ML

#ProblemDifficultyTimeProductKey ChallengeWhat Meta Evaluates
19Design Meta Ads Targeting SystemHard45 minMeta AdsUser interest modeling, privacy (no cross-app tracking post-ATT)Ads relevance without invasive tracking; privacy-preserving ML
20Design Ad Auction and Bid OptimizationHard45 minMeta AdsAuction mechanics, pacing, budget optimizationRevenue maximization with advertiser satisfaction constraints

Integrity & Safety ML

#ProblemDifficultyTimeProductKey ChallengeWhat Meta Evaluates
21Design a Hate Speech Detection SystemMedium40 minContent IntegrityMultilingual, context-dependent, adversarial robustnessPrecision vs. recall tradeoff for content moderation at scale
22Design a Fake Account Detection SystemMedium35 minAccount IntegrityBehavioral signals, graph analysis, evasion detectionNetwork-based features, temporal patterns, adversarial actors

ML Infrastructure

#ProblemDifficultyTimeProductKey ChallengeWhat Meta Evaluates
23Design a Real-Time Feature PipelineHard45 minML PlatformOnline feature computation, consistency with offlineFeature freshness vs. consistency; real-time aggregations
24Design a Model Serving System for 1M QPSHard45 minML PlatformUltra-low latency, high throughput, model versioningBatching, caching, model optimization (quantization, distillation)

:::note Meta ML Design Framework Meta interviewers expect this structure (move quickly through each):

  1. Problem Clarification (2 min): What product? What metric? What scale?
  2. Metrics (3 min): Business metric (revenue, engagement) + ML metric (AUC, NDCG) + Guardrail metric (integrity, time well spent)
  3. System Architecture (5 min): High-level pipeline (data -> features -> model -> serving)
  4. Feature Engineering (8 min): User features, item features, context features, interaction features
  5. Model Architecture (7 min): Embedding layers, interaction modeling, multi-task heads
  6. Training & Evaluation (5 min): Training data, labels, offline eval, online A/B test
  7. Serving (5 min): Latency, throughput, model refresh cadence
  8. Deep Dive (5 min): Interviewer picks one area; go deep

Meta interviewers value depth in feature engineering and model architecture. Spend extra time there. :::

Section 3: Product Sense & Applied ML (3 Problems)

Meta places high value on product intuition. These problems test whether you can connect ML decisions to product outcomes.

#ProblemDifficultyTimeTopicWhat Meta Evaluates
25Engagement metrics are up but user satisfaction is down. Diagnose and fix.Medium25 minMetric DesignUnderstanding of goodhart's law; proxy metric failure; time well spent vs. engagement
26Design an experiment to test a new ranking algorithm with potential integrity risksMedium25 minExperimentationHoldout groups, guardrail metrics, rollback criteria, staged rollout
27A model improves AUC by 0.5% offline but shows no online improvement. Why?Medium20 minDebuggingOnline/offline gap analysis: distribution shift, feature freshness, position bias, label delay

Section 4: Behavioral (3 Discussion Topics)

Meta behavioral rounds focus on impact, speed, and bold decision-making aligned with Meta's values.

#TopicTimeMeta Values Tested
28Tell me about a time you moved fast and broke something. What did you learn?15 minMove Fast: willingness to take risks, learn from failures, iterate quickly
29Describe a project where you had to make a decision with incomplete data15 minBe Bold: comfort with ambiguity, bias toward action, iterative approach
30Tell me about your most impactful ML project. Quantify the impact.15 minFocus on Impact: metrics-driven thinking, connecting ML to business outcomes

:::tip Meta Behavioral Preparation Meta values differ from Google's Googleyness:

  • Move Fast: Show you bias toward action over analysis paralysis
  • Be Bold: Show you take calculated risks rather than always playing it safe
  • Focus on Long-Term Impact: Show you think beyond the current quarter
  • Build Social Value: Show you consider societal impact of your work
  • Be Open: Show you share information and collaborate transparently

Every story should include quantified impact: "This improved CTR by X%, which translated to $Y revenue impact." :::

Meta-Specific Preparation Strategies

1. The Two-Problem Speed Drill

Meta coding rounds have two problems. Practice solving two medium problems back-to-back:

SessionProblem 1 (15 min)Problem 2 (20 min)Rest (10 min)
Day 1Two SumGroup AnagramsReview approach speed
Day 2Valid ParenthesesLRU CacheReview implementation speed
Day 3Binary SearchCourse ScheduleReview debugging speed
Day 4#1 (LRU Cache)#5 (Min Window Substring)Time yourself
Day 5#2 (Right Side View)#6 (Alien Dictionary)Target: <35 min total

2. Product ML Mapping

For every ML concept, connect it to a Meta product:

ML ConceptMeta Product Application
Collaborative filteringNews Feed ranking, Reels recommendation
Embedding modelsUser/content embeddings for retrieval
Multi-task learningPredicting multiple engagement signals (like, share, comment, hide)
CalibrationAd auction bid optimization
Online learningReal-time ranking signal updates
Anomaly detectionFake account detection, platform abuse
Multi-objective optimizationEngagement vs. integrity vs. time well spent
Graph neural networksSocial graph features for PYMK

3. Feature Engineering Depth

Meta interviewers spend significant time on feature engineering. Prepare detailed feature lists for recommendation systems:

User Features:
- Demographics (age bucket, country, language)
- Historical engagement (likes, comments, shares, hides \text{---} last 1h, 1d, 7d, 30d)
- Social graph (friend count, group memberships, page follows)
- Session features (time of day, device, connection type)
- Interest embeddings (learned from historical engagement)

Item Features:
- Content type (text, image, video, link)
- Creator features (follower count, post frequency, integrity score)
- Content embeddings (visual, textual, multimodal)
- Engagement statistics (historical CTR, share rate, hide rate)
- Freshness (time since posted)

Cross Features:
- User-creator affinity (historical interaction between this user and creator)
- Social proof (did friends engage with this content?)
- Topic match (user interest vs. content topic similarity)
- Contextual match (content type preference by time of day)

4-Week Meta Prep Plan

WeekFocusProblemsDaily Load
Week 1Coding speed drills#1-122-3 problems/day, timed
Week 2Product ML design#13-181 design/day
Week 3Ads/integrity/infra design#19-241 design/day
Week 4Applied ML + behavioral + mocks#25-30 + mocksMixed daily

Week 1: Coding Speed Drills

Day 1: #1, #2 (LRU Cache, Right Side View \text{---} target 30 min total)
Day 2: #3, #4 (Trie with wildcard, weighted random \text{---} target 35 min total)
Day 3: #5, #6 (Min window substring, alien dictionary \text{---} target 40 min total)
Day 4: #7, #8 (Batch GD, AUC-ROC \text{---} target 30 min total)
Day 5: #9, #10 (Feature hashing, online learning \text{---} target 30 min total)
Day 6: #11, #12 (Stratified sampling, cosine similarity \text{---} target 22 min total)
Day 7: Speed re-run: solve ALL 12 problems, targeting <4 hours total

Week 2: Product ML Design

Day 1: #13 (News Feed ranking \text{---} the canonical Meta problem)
Day 2: #14 (Instagram Reels \text{---} cold-start + engagement)
Day 3: #15 (PYMK \text{---} graph + ML combination)
Day 4: #16 (Marketplace ranking \text{---} location + trust)
Day 5: #17 (Instagram Explore \text{---} discovery + diversity)
Day 6: #18 (WhatsApp spam \text{---} behavioral signals without content)
Day 7: Re-do #13 and #14 from scratch under 45-min time pressure

Week 3: Ads, Integrity, Infrastructure

Day 1: #19 (Ads targeting \text{---} post-ATT privacy challenges)
Day 2: #20 (Ad auction \text{---} mechanism design + ML)
Day 3: #21 (Hate speech detection \text{---} precision/recall for content mod)
Day 4: #22 (Fake account detection \text{---} graph + behavioral)
Day 5: #23 (Real-time feature pipeline \text{---} freshness + consistency)
Day 6: #24 (Model serving at 1M QPS \text{---} optimization + batching)
Day 7: Review all designs; practice feature engineering depth

Week 4: Applied ML + Behavioral + Mocks

Day 1: #25 (Engagement vs. satisfaction diagnosis)
Day 2: #26 (Experiment design with integrity risks)
Day 3: #27 (Online/offline metric gap debugging)
Day 4: #28, #29, #30 (Behavioral stories \text{---} practice out loud)
Day 5: Full mock: 2 coding + 1 ML design + 1 behavioral
Day 6: Full mock: 2 coding + 1 ML design + 1 behavioral
Day 7: Final review; speed-run coding problems one more time

Problem Deep Dives

Problem 13: Design News Feed Ranking

Why Meta asks this: News Feed is Meta's core product. It touches every ML concept: recommendation, ranking, multi-objective optimization, real-time inference, and responsible AI. Your answer reveals your breadth and product intuition.

Architecture:

Content Pool -> Candidate Selection -> Scoring -> Ranking -> Integrity Filter -> User

1. Candidate Selection (~500 candidates from ~1500 eligible)
- Friends' posts (social graph filtering)
- Followed pages and groups
- Suggested content (explore-like discovery)
- Ads (separate auction winner injection)

2. Multi-Objective Scoring
- Predict P(like), P(comment), P(share), P(hide), P(report)
- Each prediction from a separate head on a shared backbone
- Combine: Score = w1*P(like) + w2*P(comment) + w3*P(share) - w4*P(hide) - w5*P(report)
- Weights tuned via online experiments

3. Model Architecture
- User tower: user features + history embeddings
- Content tower: content features + creator embeddings
- Interaction layer: cross-attention or factorization machines
- Multi-task heads: shared bottom, task-specific tops

4. Features (the differentiator in Meta interviews)
- User: engagement history, time patterns, device, social graph
- Content: type, creator, age, engagement rate, embedding
- Context: time, session depth, previous items shown
- Social: friend engagement signals, social proof

5. Responsible Ranking
- Integrity classifier demotes harmful content
- "Time well spent" objective alongside engagement
- Diversity injection to avoid filter bubbles
- Creator distribution fairness

6. Serving
- Score at impression time (real-time)
- Feature freshness: user features updated every few minutes
- Model refresh: daily or weekly depending on model size
- Latency budget: <100ms for full ranking

Problem 24: Design Model Serving at 1M QPS

Why Meta asks this: Meta serves ML predictions for every feed load, ad impression, and content decision. At Meta scale, model serving is an infrastructure challenge as much as an ML challenge.

Architecture:

1. Request Flow
Client -> Load Balancer -> Feature Assembly -> Model Inference -> Response

2. Feature Assembly (critical path)
- Online features: fetch from feature store (Redis/Memcache, <5ms)
- Real-time features: compute on-the-fly (session signals)
- Batch features: pre-computed, cached
- Feature assembly budget: <10ms

3. Model Inference
- Model format: TorchScript, TensorRT, or ONNX
- Optimization:
- Quantization (FP32 -> INT8, 2-4x speedup)
- Distillation (large teacher -> small student)
- Pruning (remove unimportant weights)
- Operator fusion (combine layers for fewer kernel launches)
- Batching: accumulate requests for GPU efficiency
- Inference budget: <10ms

4. Scaling Strategies
- Horizontal: more inference servers behind load balancer
- Model sharding: split model across GPUs for large models
- Caching: cache predictions for popular items (short TTL)
- Tiered models: lightweight model for most requests, heavy for high-value

5. Reliability
- Canary deployment: 1\% traffic to new model, monitor
- Fallback: if inference fails, serve from lightweight backup model
- Circuit breaker: if error rate > threshold, route to fallback
- Graceful degradation: drop optional features under load

Meta vs. Google Comparison

DimensionMeta StyleGoogle Style
Coding rounds2 problems/round (speed)1 problem + follow-ups (depth)
Approach discussionPick and execute fastDiscuss 2-3 approaches first
System designProduct-specific (Feed, Ads)Generic + scale (Search, YouTube)
Evaluation emphasisExecution speed, product senseClean code, multiple approaches
BehavioralImpact quantification, boldnessGoogleyness, humility, collaboration
Feature engineeringDeep detail expectedMentioned but less deep
Follow-upsNew problemHarder constraint on same problem

:::danger Common Reasons Meta Rejects Candidates

  1. Too slow on coding. Cannot complete two problems in 45 minutes.
  2. Generic system design. Describes a generic recommendation system without product specifics.
  3. Shallow feature engineering. Cannot name 20+ features for a recommendation system.
  4. No quantified impact. Cannot tie technical decisions to business outcomes.
  5. Over-thinking coding. Spends 10 minutes discussing approach when Meta wants you to code.
  6. Missing integrity considerations. Does not mention content safety, fake accounts, or adversarial users. :::

Difficulty Distribution

DifficultyProblemsCount
Easy#121
Medium#1, #2, #3, #4, #7, #8, #9, #10, #11, #15, #16, #17, #18, #21, #22, #25, #26, #2718
Hard#5, #6, #13, #14, #19, #20, #23, #248
Behavioral#28, #29, #303

Meta Interview Level Expectations

LevelCoding SpeedML Design DepthProduct IntuitionImpact Stories
E3 (IC3)2 mediums in 40 minBasic ML systemAwareness of product"I contributed to..."
E4 (IC4)1 medium + 1 hard in 42 minFull product ML systemConnect ML to product metrics"My work improved X by Y%"
E5 (IC5)2 problems incl. hard in 40 minProduct ML with depth + tradeoffsPropose product improvements"I led a project that..."
E6 (IC6)Clean, optimal, teach-levelNovel system with multi-team impactDefine product direction"I changed how the team..."

Problem Deep Dives (Continued)

Problem 1: LRU Cache (Meta Speed Version)

Why Meta asks this as a speed problem: At Meta, LRU Cache is not a 25-minute problem. It is a 15-minute warm-up problem. If you spend more than 15 minutes on it, you will not have time for the second problem. This means you must have the implementation memorized and be able to write it without thinking.

Speed Implementation Checklist:

[ ] Declare Node class (key, val, prev, next) \text{---} 30 seconds
[ ] Declare LRUCache with capacity, cache dict, head/tail dummies \text{---} 30 seconds
[ ] Wire head.next = tail, tail.prev = head \text{---} 10 seconds
[ ] Write _remove(node) helper \text{---} 30 seconds
[ ] Write _add_to_front(node) helper \text{---} 30 seconds
[ ] Write get(key) \text{---} 30 seconds
[ ] Write put(key, value) \text{---} 60 seconds
[ ] Test with example \text{---} 60 seconds
Total: ~5 minutes if practiced

Speed Practice Drill:

  1. Set a 5-minute timer
  2. Implement LRU Cache from scratch (no reference)
  3. Test with: put(1,1), put(2,2), get(1), put(3,3), get(2)
  4. If it takes >8 minutes, practice again tomorrow

Problem 4: Random Pick with Weight

Why Meta asks this: Weighted random selection is fundamental to ad serving, A/B test traffic allocation, and content sampling. This problem tests whether you understand prefix sums and binary search for probability distributions.

Solution:

import random
import bisect

class WeightedRandom:
def __init__(self, weights):
self.prefix = []
running = 0
for w in weights:
running += w
self.prefix.append(running)
self.total = running

def pick(self):
target = random.random() * self.total
return bisect.bisect_left(self.prefix, target)

Key Insight: Build a prefix sum of weights. Generate a random number in [0, total_weight). Binary search for where it falls in the prefix array. This gives O(log n) picks after O(n) preprocessing.

Follow-Up Questions:

  • What if weights change frequently? (Use a Fenwick tree / BIT for O(log n) updates)
  • What if you need to sample without replacement? (Reservoir sampling with weighted version)
  • How does this relate to Meta's ad auction? (Weighted sampling for ad candidate selection)

Problem 9: Implement Feature Hashing (Hashing Trick)

Why Meta asks this: Feature hashing is used extensively in Meta's ads and ranking systems to handle high-cardinality categorical features without maintaining a full vocabulary.

Solution:

import hashlib
import numpy as np

def feature_hash(features, n_buckets=2**20):
"""
Hash feature names + values into a fixed-size vector.
Handles collision by signed hashing (reduces bias).
"""
vector = np.zeros(n_buckets)

for name, value in features.items():
# Hash to bucket index
key = f"{name}={value}"
hash_val = int(hashlib.md5(key.encode()).hexdigest(), 16)
bucket = hash_val \% n_buckets

# Signed hashing: use another hash to determine sign
sign_hash = int(hashlib.sha1(key.encode()).hexdigest(), 16)
sign = 1 if sign_hash \% 2 == 0 else -1

vector[bucket] += sign

return vector

Why Signed Hashing Matters:

  • Without sign: hash collisions always add, creating a positive bias
  • With sign: collisions cancel on average (E[error] = 0), maintaining unbiasedness
  • This is the Weinberger et al. (2009) hashing trick

Interview Discussion Points:

  • Tradeoff: more buckets = fewer collisions but more memory
  • Feature hashing vs. vocabulary-based encoding: no OOV problem, but collisions lose information
  • Used in Vowpal Wabbit, scikit-learn's HashingVectorizer, and Meta's ads stack

Problem 14: Design Instagram Reels Recommendation

Why Meta asks this: Reels is Meta's fastest-growing product and a direct competitor to TikTok. The recommendation system must handle cold-start content, optimize for watch time, and maintain creator ecosystem health.

Architecture:

Content Pool -> Candidate Generation -> Ranking -> Reranking -> Serving

1. Candidate Generation (multiple sources)
a. Interest-based: user embedding -> ANN search in content embedding space
b. Social: content engaged by friends/following
c. Trending: globally popular content in user's locale
d. Creator-based: new content from followed creators
e. Exploration: random high-quality content for discovery
Total: ~500-1000 candidates

2. Ranking Model
- Architecture: Two-tower for retrieval, cross-attention for ranking
- Multi-objective prediction:
P(finish_watch), P(like), P(share), P(comment), P(follow_creator)
P(not_interested), P(report)
- Labels:
- Positive: watch >80\% of video, like, share
- Negative: skip within 2s, "not interested", report
- Key feature: watch-through rate, not just click-through rate

3. Cold-Start Handling (Critical for Reels)
- Content cold-start: embed using visual + audio + text features
- Exploration allocation: reserve 10-20\% of inventory for new content
- Creator cold-start: boost new creators' content in explore
- Progressive rollout: show to small audience, measure engagement, expand

4. Reranking Layer
- Diversity: no two consecutive reels from same creator
- Pacing: mix content types (funny, educational, music, news)
- Creator fairness: minimum impression share for active creators
- Integrity: filter harmful, misleading, or IP-violating content

5. Engagement vs. Well-Being
- "Take a Break" reminders after extended sessions
- Reduce recommendation of divisive content
- Balance addictiveness with responsible design

Key Metrics:

Metric TypeMetricWhy It Matters
PrimaryDaily active users on ReelsUser adoption
EngagementAverage watch time per sessionContent quality
ContentWatch-through rate (finish watching a reel)Better than CTR for video
CreatorCreator retention rateHealthy creator ecosystem
Integrity% content flagged by usersPlatform safety
GuardrailSession length distribution (not just mean)Prevent unhealthy usage patterns

Meta Interview Timeline

Understanding Meta's interview process helps with scheduling and preparation:

StageTimelineWhat Happens
Recruiter screenDay 030-min call; background, motivation, role fit
Technical phone screenDay 7-1445-min coding (1-2 problems)
Phone screen resultDay 14-21Pass/fail notification
Onsite schedulingDay 21-354-5 rounds scheduled over 1-2 days (virtual or in-person)
Onsite interviewsDay 35-50Full loop: 2 coding + 1 ML design + 1 behavioral
Hiring committeeDay 50-65Committee reviews packet; may request additional signal
OfferDay 65-80Compensation discussion and offer letter

:::tip Meta Timeline Optimization

  • Ask the recruiter for the exact interview format (it varies by team)
  • Request at least 3 weeks between phone screen and onsite
  • If you pass the phone screen, the hardest part is behind you (phone screen has the highest bar)
  • Meta often lets you choose your preferred coding language :::

Meta-Specific ML Concepts to Study

Concepts that appear disproportionately in Meta interviews:

ConceptWhy Meta CaresWhere It Appears
Multi-task learningOne model predicting multiple engagement signalsFeed ranking, Reels, Ads
Feature interaction modelingDeepFM, DCN for cross-feature learningAds, Recommendations
Embedding-based retrievalTwo-tower models for candidate generationAll recommendation systems
CalibrationPredicted probabilities must match actual ratesAd auction (bid optimization)
Position bias correctionUsers click more on items shown firstFeed ranking training data
Label definitionWhat counts as positive/negative engagementFeed, Reels, Marketplace
Multi-objective optimizationBalancing engagement, integrity, time well spentPlatform-level optimization
Approximate nearest neighborFast embedding lookup for billions of itemsFAISS (Meta's ANN library)
Online learningModel updates from real-time engagement signalsAds, trending content
Privacy-preserving MLPost-ATT ad targeting without cross-app trackingAds targeting

Meta Behavioral Story Bank

Prepare 5-7 stories that can be adapted to any Meta behavioral question:

Story TemplateMaps ToKey Element
"I shipped X in Y weeks, which improved Z by N%"Move Fast, ImpactSpeed + quantified outcome
"I proposed an unconventional approach that the team initially resisted"Be BoldData-driven persuasion, willingness to take risk
"I noticed a fairness issue and raised it despite pressure to ship"Build Social ValueEthical decision-making, proactive identification
"I simplified a complex system from N components to M"Move FastReducing complexity, not just adding features
"I collaborated with team X to solve a problem neither could solve alone"Be OpenCross-team collaboration, shared ownership
"I invested time in infrastructure that paid off 6 months later"Long-Term ImpactStrategic thinking beyond quarterly goals
"I made a mistake, here's what I learned and changed"Growth mindsetAccountability, learning, process improvement

Meta Offer Calibration (ML Roles, 2024-2025)

Approximate total compensation ranges to calibrate expectations (US, Bay Area):

LevelTitleBaseBonusRSU (Annual)Total Comp
E3IC3 (SWE)$130-160K10%$50-100K$200-280K
E4IC4 (SWE)$160-190K15%$100-200K$290-430K
E5IC5 (Senior)$190-230K15%$200-350K$430-620K
E6IC6 (Staff)$230-280K20%$350-600K$650-950K

:::note Compensation notes:

  • RSUs vest over 4 years with annual refreshers
  • ML roles sometimes receive a premium over general SWE
  • Research Scientist roles may be leveled differently
  • These ranges shift annually; verify with current data :::

Next Steps

After completing Meta-Style preparation:

© 2026 EngineersOfAI. All rights reserved.