Seeking NLP expertise for search query intent classification system

50 views

Skip to first unread message

Ale Garcia

unread,

Mar 16, 2026, 5:06:42 PMMar 16

to nltk-users

Hi NLTK community,

I'm building a market intelligence platform that classifies search query intent (informational, navigational, transactional, research, pre-purchase) and detects emerging market signals from Google Ads data.

Key challenges:

Intent classification on short/ambiguous queries (≤3 tokens, 30% of data)
Limited labeled data (targeting 10k examples, F1 ≥ 0.75)
Real-time processing requirements (p50 ≤ 200ms)
Current approach:

Distilled sentence transformers + MLP classifier
Hybrid rule-based fallback for explicit signals
Active learning loop for low-confidence predictions
Questions for the community:

Best practices for handling short query ambiguity with limited labels?
Recommended weak supervision techniques for bootstrapping intent classifiers?
Evaluation strategies beyond standard F1 for market detection quality?
NLTK tools that complement modern transformer approaches for this use case?
Happy to share more details about the system architecture and data schema. Thanks in advance for any insights!

Reply all

Reply to author

Forward

0 new messages