About: 👨💻 Full-time Software Developer | 📹 YouTuber @programmingwithsuman2300 | 💡 Passionate Educator
Thanks! 😊 To keep data fresh, we don’t use any database — everything is scraped live using Bright Data’s Fast MCP server. It lets us simulate real user behavior (search, click, wait), ensuring the latest news. LangChain helps route queries smartly, and using MCP improved performance massively over our old Puppeteer + proxy setup.
pretty cool project tbh - but i always wonder if stuff like this keeps improving or loses steam after the first hype, you think tools like this stick around long-term or do most vanish?
Absolutely, and that’s something we’ve thought about too! We’re also planning to integrate a vector database to enable smarter, semantic searches across analyzed news content. That way, users can explore trends and narratives over time — not just in real-time but contextually as well.
Real-time news analysis with LLMs is so cool, especially with the constant stream of updates. Super interested to know how you handled challenges around keeping the data fresh and relevant - was there a specific framework or tool that helped the most?