OpinionFlow – Real-Time Review Intelligence with Bright Data, Gemini 2.0 & Pinecone
Shivansh Singh

Shivansh Singh @shivanshsinghh

About: Data engineer & AI enthusiast | Graduate student at Northeastern University

Location:
Boston, Massachusetts
Joined:
Mar 31, 2024

OpinionFlow – Real-Time Review Intelligence with Bright Data, Gemini 2.0 & Pinecone

Publish Date: May 26
353 77

This is a submission for the Bright Data AI Web Access Hackathon


🔥 What I Built

OpinionFlow helps users skip the endless scroll. It's an AI-powered review assistant that:

  • Crawls live product reviews from Amazon and Walmart using Bright Data MCP
  • Summarizes insights using Gemini Flash
  • Caches results semantically using Pinecone
  • Lets users ask product-specific questions via LangChain

Want to know if the AirPods Pro have battery issues? Just ask — you’ll get instant, evidence-backed answers.


📌 Demo


🧠 How It Works

Step Description
🔍 Search Users enter a query (e.g., "Noise Buds X Prime")
📄 Crawl Bright Data scrapes live reviews from both stores
Summarize Gemini Flash extracts sentiment, pros/cons, and key specs
🧠 Cache MiniLM embeddings stored in Pinecone to avoid repetition
💬 Answer LangChain generates natural-language responses with citations

Product discovery UI


🧩 Features at a Glance

Feature Description
💬 Instant AI Answer Box Summary with links to real reviews
👍 Top Pros / Cons Highlighted from verified buyers
📊 Sentiment Comparison Side-by-side scores from Amazon & Walmart
🏷️ Aspect Mini-Charts Dynamic breakdowns: battery, comfort, etc.
🧭 Multi-Store Tabs Compare similar SKUs across platforms
🔎 Review Explorer Drill down into sources and keywords

Sentiment analysis


🚀 Bright Data in Action

MCP Tool Role
SERP API Fetches product listings via Google Search
Web Unlocker Unblocks product pages seamlessly
Scraping Browser Renders full review sections
Browser API + Playwright Handles dynamic navigation like "See all reviews"

Bright Data saved me dozens of hours — no captchas, no proxy headaches, just clean data.


🧠 Architecture Overview

System Architecture Diagram

OpinionFlow is built with a modular microservice-style architecture with FastAPI as the core backend. Key flows include:

  • Product discovery → via Bright Data SERP API
  • Review scraping → using custom extractors
  • Analysis → powered by Gemini Flash
  • Semantic caching → via Pinecone vector search
  • Natural Q&A → powered by LangChain and Gemini

🧰 Tech Stack

FastAPI, React.js, Bright Data MCP, Gemini Flash, Pinecone, LangChain, HuggingFace MiniLM, Netlify, Cloud Run


🔮 What's Next

  • Add Target.com integration
  • Launch a real-time price tracker
  • Let users add their own reviews
  • Enable Gemini-powered follow-ups in chat

🙏 Thanks

Big thanks to Bright Data and DEV for the challenge. If you liked this project, consider checking out:

🌟 github.com/luminati-io/brightdata-mcp

Built with ❤️ by @shivanshsinghh


🏷️ Tags

#BrightData #Hackathon #AI #LLM #Gemini #LangChain #Ecommerce #ProductReviews #FastAPI #React #Pinecone #SemanticSearch

Comments 77 total

  • Amit Kashyap
    Amit KashyapMay 26, 2025

    How you integrated Amazon review scrapping in this? BTW project looks solid.

  • Jenny Sa
    Jenny SaMay 26, 2025

    Good work

  • Dotallio
    DotallioMay 26, 2025

    Super cool seeing live reviews merged with quick AI insights like this. Any plans to add more stores or deeper personalization next?

    • Shivansh Singh
      Shivansh SinghMay 27, 2025

      Thank you so much for your comment. Yes I am thinking to add more stores and do deeper personalizations in this project. This project really resonates with me.

  • Bahman Simon
    Bahman SimonMay 27, 2025

    Great work shivansh, thanks for sharing this

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Really appreciate that - it was a fun challenge putting all together.

  • Nathan Tarbert
    Nathan TarbertMay 27, 2025

    pretty cool seeing tools get tied together like this - makes me curious, you think momentum on stuff like this depends more on habits or just chasing little wins day by day?

    • Shivansh Singh
      Shivansh SinghMay 28, 2025

      Thanks! Honestly, I think it's a mix of both - I definitely chase small wins to stay motivated, but a lot of it also just comes from long-term habits. I've been coding since I was a kid, so some of that momentum is just muscle memory at this point.

  • Nitya Bhardwaj
    Nitya BhardwajMay 28, 2025

    This is such a cool use of Ai and web scrapping.. something like this can save so much time and efforts of users. Great stuff man!

    • Shivansh Singh
      Shivansh SinghMay 28, 2025

      I'm glad you liked it! Thanks for commenting.

  • Divyansh Singh
    Divyansh SinghMay 28, 2025

    Top-notch work! Sir

  • АнонимMay 28, 2025

    [hidden by post author]

  • Shagun Singh
    Shagun SinghMay 29, 2025

    Great work as always!!

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Thanks for always showing up and supporting. Means a lot!

  • Scott Peralta
    Scott PeraltaMay 29, 2025

    Fabulous this is! Such a life saver. Good work!

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Thank you so much! Glad to hear it's helpful to you. Let me know if you try it out!

  • Yixuan He
    Yixuan HeMay 31, 2025

    Sometimes the website shows, error loading analysis

    • 力 叶
      力 叶Jun 1, 2025

      Happened with me too! But then I guess now it's working.

  • Nancy Alave
    Nancy AlaveJun 1, 2025

    This is incredibly helpful for comparison shoppers like me! Any plans to add Flipkart or BestBuy next?

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      The scraping structure is modular, so Flipkart and BestBuy are 90% ready - just need to fine-tune HTML selectors and test a few edge cases.

  • Philip M Posmyk
    Philip M PosmykJun 1, 2025

    Tried searching for 'MacBook Air' — worked beautifully. Would be great to export reviews too!

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Great to hear that! Expoert is a solid idea - CSV or PDF maybe? I'll see how I can fit it into the next sprint.

  • 力 叶
    力 叶Jun 1, 2025

    Would love to see how it handles products with thousands of variants. Great job Shivansh!

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      That's definitely a trick part! Right now, it clusters reviews at the product level, but doesn't differentiate by variant (like size or color).

  • Dalimin Iswahyudi
    Dalimin Iswahyudi Jun 1, 2025

    Honestly didn’t think something like this could be built in a hackathon timeframe. Hats off!

  • Estêvão Delvalle
    Estêvão DelvalleJun 1, 2025

    Maybe in future versions, you could allow users to contribute their own reviews and see how Gemini classifies them?”

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Ooh I really like that! Like a sandbox mode for testing your own feedback - thanks for the idea, bookmarking it.

  • Victor Svensson
    Victor SvenssonJun 1, 2025

    Brooo this is insane 🔥🔥 I remember when you were just sketching this out — now it’s a full-blown AI system with caching and scraping and everything. Let’s gooo!

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Man that means a ton! You've seen the whole journey, from doodles to delivery. Appreciate you cheering me on the way!

  • Süleyman Öztürk
    Süleyman ÖztürkJun 1, 2025

    Just used the live demo — the way it breaks down pros/cons and sentiment across stores is honestly more helpful than most YouTube review videos lol.

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Hahah I'll take that as a big win. Glad you liked it!

  • Fletcher Mitchell
    Fletcher MitchellJun 1, 2025

    One suggestion: showing the last scraped timestamp for each store would build a lot of user trust. Just a little ‘Fresh as of…’ badge maybe?

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Love this and totally agree. Adding a "Last updated: X mins ago" badge per store is actually super simple to implement with current cache structure. Will try to ship that next !

  • Toni Raić-Sudar
    Toni Raić-SudarJun 1, 2025

    That semantic caching with Hugging Face is super interesting. Do you embed each review separately or batch them by product?

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Great question - I batch them by product for now, since it makes Gemini's job easier too. Thinking of switching to per-review embedding for more granular analysis later on.

  • Lucas Lambert
    Lucas LambertJun 1, 2025

    Loved the demo! Would be cool to know how Pinecone handles paraphrased or fuzzy queries. Does it match on semantic similarity only?

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Yes exactly - it works on embeddings, so even if oyu say "best wireless earbuds" and someone says "top bluetooth headphones", it knows to match based on meaning, not words. Super powerful!

  • Tugiman Tampubolon
    Tugiman TampubolonJun 1, 2025

    Quick question: does the Gemini analysis handle multilingual reviews? Or is it limited to English for now?

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Currently it's mostly tuned for English, though Gemini does handle multilingual reasonably well. Planning to test with Hindi and Spanish next to see how it holds up.

  • Joshua Albrecht
    Joshua AlbrechtJun 1, 2025

    One idea: could be awesome to let users upvote or bookmark the most relevant insights per product. Adds a human layer on top of AI

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Yes! That would make it more community driven and help get most out of it.

  • เขมจิรา เกตุอารี
    เขมจิรา เกตุอารีJun 1, 2025

    Absolutely loved the clarity of the summaries — way better than skimming 100+ reviews. You’ve nailed both utility and presentation

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Thanks so much - took a tweaking to make Gemini return clean and accurate summaries. Glad it's working as intended!

  • Ivan Marin
    Ivan MarinJun 1, 2025

    This is the kind of thing I’d actually use before buying anything online. Let me know when you launch a full version!

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      That made my day! I;m polishing the UI and will definitely drop an update once it's product-ready.

  • Охинбархаг Doe
    Охинбархаг DoeJun 1, 2025

    Хөөе энэ үнэхээр гайхалтай.

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Баярлалаа! Thank you so much — appreciate the love!

  • BackSlash Flutter
    BackSlash FlutterJun 1, 2025

    Great work

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Thank you! Really appreciate your time to check this out

  • Pedro Rodriguez
    Pedro RodriguezJun 1, 2025

    What stood out to me is how you have aligned semantic caching with product discovery - it felt like the kind of detail that only comes from actually building and testing products deeply.

    Thank you again!

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Woow, appreciate that insight - yes caching was the trickiest part in this. Took a few iterations to get the key structure right. Pinecone is great too

  • Valbona Sefa
    Valbona SefaJun 1, 2025

    One thing I’d love to see is a timeline feature — like how sentiment shifts over time for a product as updates roll out or versions change.

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Yes yes yes — time-based review analysis is one of my favorite ideas. Currently the backend already stores timestamps per review, so i can do sentiment shifts over weeks/months. Thanks for commenting

  • Andres Kuznetsov
    Andres KuznetsovJun 1, 2025

    Would it be possible to let users upload a product URL from any store and get analysis instantly? That would make this even more flexible.

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Yes! I've already built in support for direct URLs. Would be live soon!

  • Panna Antal
    Panna AntalJun 1, 2025

    I didn’t even know Pinecone could be used like this. Thanks for showcasing how semantic search can power something practical like this.

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Absolutely! That's what I was aiming for - real-world usage that just goes beyond the demos. Happy you found it interesting!

  • Antonio Fiore
    Antonio FioreJun 1, 2025

    Honestly surprised at how polished this is. The backend flow makes sense and the front-end is clean too. Respect.

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      That means a lot - thank you! It took a lot of iterations to get it this clean. Happy the backend/frontend flow felt smooth to you.

  • John Montahar
    John MontaharJun 2, 2025

    Great work. I also applied to this hackathon, but couldn't finalize the end product. Great to see how you finalized this project.

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Totally get that - sometimes it just doesn't click in time. Hope you jump back into the next one!

  • Matilda Sobotková
    Matilda SobotkováJun 2, 2025

    Was it tough to parse the HTML across different stores like Walmart and Amazon?

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      100%. Every store has different DOMs, lazy loading, and anti-bot tricks. The use of browser api, and other servies from Brightdata made it easy to extract information.

  • Вениамин Смирнов
    Вениамин СмирновJun 2, 2025

    Оцените пошаговое руководство. Пользовательский интерфейс интуитивно понятен и не кажется перегруженным.

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Большое спасибо! Appreciate the feedback — I worked hard to keep the UI clean and beginner-friendly.

  • Simone Brandt
    Simone BrandtJun 2, 2025

    Curious how LangChain fits into the final RAG flow — are you using custom retrievers?

    • Shivansh Singh
      Shivansh SinghJun 2, 2025

      Yes- I'm using Langchain to orchestrate the retrieval and prompting. The retriever pulls from Pinecone using cosine similarity and then fiilters based on metadata like store+timestamp before the Gemini call.

  • Cyril de Cock
    Cyril de CockJun 2, 2025

    Just saw your linkedin post, and wanted to really thank you for this creation.

  • Eliah Cornelis
    Eliah CornelisJun 4, 2025

    Question: How do you manage inconsistencies in review formats between stores? I imagine Amazon and Walmart have very different structures.

  • Sofia Meldgaard
    Sofia MeldgaardJun 4, 2025

    This feels like ChatGPT with a memory — but for ecommerce reviews. Great job!

  • René Šimek
    René ŠimekJun 4, 2025

    Could be intereting to integrate price tracking next. This way intelligence + price tracking would be really good.

  • Tomas Reuter
    Tomas ReuterJun 4, 2025

    Would love to see multilingual support in future, you are currently looking for amazon.com but they have different variants for different countries. So maybe you can try to look at or detect user's location and then connect to amazon or walmart. BTW! really great work.

  • Michalina Laskowska
    Michalina LaskowskaJun 4, 2025

    The way it summarizes real user sentiment into a one-liner is impressive. Curious — how long does it take from query to final answer?

  • Mariana Silva
    Mariana SilvaJun 4, 2025

    Great project, wishing you luck for this hackathon!

  • Jorge Valencia
    Jorge Valencia Jun 4, 2025

    Super clean interface - loved it

Add comment