Hitchhiker's Guide to AI, Software Architecture, and Everything Else: Building an LLM-Powered Meta Search Engine for Price Comparison

Introduction and Conceptual Overview

A meta search engine for price comparison represents a sophisticated system that aggregates product pricing information from multiple e-commerce platforms to help users find the best deals. When enhanced with Large Language Models (LLMs), such a system becomes significantly more powerful in understanding user intent, processing unstructured data, and providing intelligent recommendations. Unlike traditional price comparison tools that rely on rigid keyword matching and predefined product categories, an LLM-powered system can interpret natural language queries, understand product variations, and adapt to different merchant formats dynamically.

The core challenge in building such a system lies in handling the inherent messiness and variability of e-commerce data. Product titles vary wildly between merchants, specifications are presented in different formats, and user queries often contain ambiguous or incomplete information. For example, a user searching for “Braun shaver” might be looking for an electric razor, but different retailers might list the same product as “Braun Electric Shaver Series 9”, “Braun Men’s Electric Razor”, or simply “Series 9 Pro”. An LLM can bridge these semantic gaps by understanding the relationships between these different product descriptions.

The integration of LLMs into price comparison systems offers several distinct advantages over traditional approaches. First, LLMs can process natural language queries without requiring users to learn specific search syntax or navigate complex category hierarchies. Second, they can extract structured information from unstructured product descriptions, handling variations in how merchants present technical specifications. Third, they can make intelligent decisions about product equivalency across different sites, understanding when products with different names or descriptions are actually the same item.

However, building such a system also introduces unique challenges. LLMs require careful prompt engineering to produce consistent and accurate results. The system must handle rate limiting and cost considerations when making numerous API calls. Additionally, the non-deterministic nature of LLM responses requires robust error handling and validation mechanisms to ensure reliable operation.

System Architecture and Component Design

The architecture of an LLM-powered price comparison system consists of several interconnected components that work together to deliver accurate and timely price comparisons. The system follows a pipeline architecture where each component has specific responsibilities and well-defined interfaces for data exchange.

The entry point is a query processing layer that leverages an LLM to understand and normalize user input. This component transforms natural language queries into structured search parameters that can be used by downstream components. The processed query then feeds into a web scraping orchestrator that coordinates multiple specialized scrapers targeting different e-commerce platforms.

The scraping layer operates in parallel, with each scraper designed to handle the specific quirks and anti-bot measures of individual merchant sites. Raw scraped data flows into a data processing pipeline where another LLM assists in extracting structured product information from unstructured HTML content. This extracted data undergoes normalization and deduplication before being stored in a temporary cache for comparison.

The comparison engine then evaluates the normalized product data, applying various matching algorithms enhanced by LLM reasoning to ensure we are comparing equivalent products across different merchants. Finally, a ranking and presentation layer organizes the results, potentially using the LLM again to generate natural language explanations of price differences or product variations.

This architecture emphasizes modularity and scalability. Each component can be developed, tested, and deployed independently. The use of well-defined data structures for inter-component communication ensures that improvements to individual components don’t require changes throughout the entire system.

Query Processing with LLM Integration

The query processing component serves as the critical entry point where natural language user input gets transformed into structured search parameters. This transformation process requires sophisticated understanding of product categories, brand relationships, and technical specifications that traditional keyword-based systems struggle to handle effectively.

The following code example demonstrates how an LLM can process and enhance user queries:

import openai

import json

from dataclasses import dataclass

from typing import List, Optional, Dict

@dataclass

class SearchQuery:

product_type: str

brand: Optional[str]

model: Optional[str]

specifications: Dict[str, str]

price_range: Optional[tuple]

keywords: List[str]

class QueryProcessor:

def __init__(self, llm_client):

self.llm_client = llm_client

def process_query(self, user_input: str) -> SearchQuery:

prompt = f"""

Analyze this product search query: "{user_input}"

Extract and structure the following information:

1. Product type/category

2. Brand name (if mentioned)

3. Model number/name (if specified)

4. Technical specifications (capacity, size, color, etc.)

5. Price range (if mentioned)

6. Additional relevant keywords

Return the result as JSON with the following structure:

{{

"product_type": "category name",

"brand": "brand name or null",

"model": "model info or null",

"specifications": {{"key": "value"}},

"price_range": [min, max] or null,

"keywords": ["keyword1", "keyword2"]

}}

"""

response = self.llm_client.chat.completions.create(

model="gpt-4",

messages=[{"role": "user", "content": prompt}],

temperature=0.1

)

parsed_data = json.loads(response.choices[0].message.content)

return SearchQuery(

product_type=parsed_data["product_type"],

brand=parsed_data.get("brand"),

model=parsed_data.get("model"),

specifications=parsed_data.get("specifications", {}),

price_range=tuple(parsed_data["price_range"]) if parsed_data.get("price_range") else None,

keywords=parsed_data.get("keywords", [])

)

This code example illustrates how the LLM transforms ambiguous user input into structured data that can drive the rest of the search process. The QueryProcessor class encapsulates the interaction with the LLM, using a carefully crafted prompt that guides the model to extract specific types of information. The temperature setting of 0.1 ensures consistent and focused responses rather than creative variations, which is crucial for maintaining system reliability.

The structured SearchQuery object that results from this processing contains all the essential elements needed for effective product matching across different merchant sites. The product_type field helps target appropriate scraping modules, while brand and model information enables precise matching. The specifications dictionary captures technical details that vary between product variants, and the keywords list provides fallback search terms for broader matching when exact matches aren’t found.

The prompt design is particularly important in this implementation. It provides clear instructions about the expected output format and explicitly requests JSON structure, making the response easy to parse programmatically. The prompt also handles cases where information might be missing by allowing null values, ensuring the system remains robust when processing incomplete queries.

Web Scraping Infrastructure

The web scraping engine represents the data acquisition backbone of the price comparison system. Unlike monolithic scrapers that attempt to handle all sites uniformly, a robust meta search engine employs specialized scrapers tailored to individual merchant architectures. Each scraper must navigate unique page structures, handle different anti-bot measures, and adapt to frequent layout changes that are common in e-commerce sites.

The following code demonstrates a scraper orchestrator that manages multiple specialized scraping modules:

import asyncio

import aiohttp

from abc import ABC, abstractmethod

from typing import List, Dict, Any

import logging

class BaseScraper(ABC):

def __init__(self, session: aiohttp.ClientSession):

self.session = session

self.headers = {

'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36'

}

@abstractmethod

async def search_products(self, query: SearchQuery) -> List[Dict[str, Any]]:

pass

@abstractmethod

def get_site_name(self) -> str:

pass

class AmazonScraper(BaseScraper):

async def search_products(self, query: SearchQuery) -> List[Dict[str, Any]]:

search_url = self._build_search_url(query)

try:

async with self.session.get(search_url, headers=self.headers) as response:

html_content = await response.text()

return self._parse_search_results(html_content)

except Exception as e:

logging.error(f"Amazon scraping failed: {e}")

return []

def _build_search_url(self, query: SearchQuery) -> str:

base_url = "https://www.amazon.com/s"

search_terms = []

if query.brand:

search_terms.append(query.brand)

if query.model:

search_terms.append(query.model)

search_terms.extend(query.keywords)

params = {

'k': ' '.join(search_terms),

'ref': 'sr_pg_1'

}

category_map = {

'external hard drive': 'computers',

'shaver': 'beauty'

}

if query.product_type.lower() in category_map:

params['i'] = category_map[query.product_type.lower()]

return f"{base_url}?" + "&".join([f"{k}={v}" for k, v in params.items()])

def _parse_search_results(self, html: str) -> List[Dict[str, Any]]:

# Implementation would use BeautifulSoup or similar

# to extract product information from HTML

pass

def get_site_name(self) -> str:

return "Amazon"

class ScrapingOrchestrator:

def __init__(self, scrapers: List[BaseScraper]):

self.scrapers = scrapers

async def search_all_sites(self, query: SearchQuery) -> Dict[str, List[Dict[str, Any]]]:

async with aiohttp.ClientSession() as session:

for scraper in self.scrapers:

scraper.session = session

tasks = [

scraper.search_products(query)

for scraper in self.scrapers

]

results = await asyncio.gather(*tasks, return_exceptions=True)

site_results = {}

for scraper, result in zip(self.scrapers, results):

if isinstance(result, Exception):

logging.error(f"Scraper {scraper.get_site_name()} failed: {result}")

site_results[scraper.get_site_name()] = []

else:

site_results[scraper.get_site_name()] = result

return site_results

This orchestrator design demonstrates several important architectural principles for building scalable scraping systems. The BaseScraper abstract class defines a common interface that all site-specific scrapers must implement, ensuring consistency while allowing for site-specific optimizations. This abstraction enables the system to add new merchant sites without modifying existing code, simply by implementing the BaseScraper interface for the new site.

The use of asyncio and aiohttp enables concurrent scraping across multiple sites, dramatically reducing total search time compared to sequential approaches. In a typical scenario where scraping five different merchant sites might take 10-15 seconds sequentially, the concurrent approach can complete the same task in 2-3 seconds, significantly improving user experience.

The AmazonScraper example shows how individual scrapers can leverage the structured query information to build appropriate search URLs for their target sites. The _build_search_url method demonstrates how different query components get mapped to site-specific parameters, while the category mapping shows how product types can be translated into site-specific filters that improve search accuracy.

The error handling in the orchestrator ensures that failures on individual sites do not compromise the entire search operation. This resilience is crucial in production systems where merchant sites may be temporarily unavailable, may have implemented new anti-scraping measures, or may have changed their page structures. The system gracefully degrades by continuing to provide results from successful scrapers while logging errors for later analysis.

Data Extraction and Normalization Using LLMs

Once raw HTML content has been retrieved from merchant sites, the next challenge involves extracting structured product information from the unstructured markup. This process traditionally relies on XPath selectors or CSS selectors that are brittle and require constant maintenance as sites change their layouts. LLMs offer a more flexible approach by understanding the semantic content of product pages rather than relying on specific markup patterns.

The following code example demonstrates how an LLM can extract product information from raw HTML:

import re

from bs4 import BeautifulSoup

from dataclasses import dataclass

from typing import Optional, List, Dict

@dataclass

class ProductInfo:

title: str

price: float

currency: str

availability: str

rating: Optional[float]

review_count: Optional[int]

specifications: Dict[str, str]

image_url: Optional[str]

product_url: str

class LLMDataExtractor:

def __init__(self, llm_client):

self.llm_client = llm_client

def extract_product_info(self, html_content: str, site_name: str,

original_query: SearchQuery) -> List[ProductInfo]:

soup = BeautifulSoup(html_content, 'html.parser')

for tag in soup(['script', 'style', 'nav', 'footer', 'header']):

tag.decompose()

cleaned_text = self._clean_html_text(soup.get_text())

product_sections = self._identify_product_sections(cleaned_text)

products = []

for section in product_sections:

try:

product_info = self._extract_single_product(

section, site_name, original_query

)

if product_info:

products.append(product_info)

except Exception as e:

logging.warning(f"Failed to extract product from section: {e}")

continue

return products

def _extract_single_product(self, product_text: str, site_name: str,

query: SearchQuery) -> Optional[ProductInfo]:

prompt = f"""

Extract product information from this {site_name} product listing:

Original search query context:

- Product type: {query.product_type}

- Brand: {query.brand or 'any'}

- Keywords: {', '.join(query.keywords)}

Product listing text:

{product_text[:2000]}

Extract the following information and return as JSON:

{{

"title": "full product title",

"price": numeric_price_value,

"currency": "USD/EUR/etc",

"availability": "in stock/out of stock/limited",

"rating": numeric_rating_or_null,

"review_count": numeric_count_or_null,

"specifications": {{"key": "value"}},

"is_relevant": true/false

}}

Set is_relevant to false if this product doesn't match the search intent.

For specifications, extract technical details like capacity, dimensions, model numbers.

"""

try:

response = self.llm_client.chat.completions.create(

model="gpt-4",

messages=[{"role": "user", "content": prompt}],

temperature=0.1

)

extracted_data = json.loads(response.choices[0].message.content)

if not extracted_data.get("is_relevant", True):

return None

return ProductInfo(

title=extracted_data["title"],

price=float(extracted_data["price"]),

currency=extracted_data["currency"],

availability=extracted_data["availability"],

rating=extracted_data.get("rating"),

review_count=extracted_data.get("review_count"),

specifications=extracted_data.get("specifications", {}),

image_url=None,

product_url=""

)

except (json.JSONDecodeError, ValueError, KeyError) as e:

logging.error(f"Failed to parse LLM response: {e}")

return None

def _clean_html_text(self, raw_text: str) -> str:

cleaned = re.sub(r'\s+', ' ', raw_text)

cleaned = re.sub(r'\n\s*\n', '\n', cleaned)

return cleaned.strip()

def _identify_product_sections(self, text: str) -> List[str]:

sections = []

current_section = []

lines = text.split('\n')

for line in lines:

if self._looks_like_product_title(line):

if current_section:

sections.append('\n'.join(current_section))

current_section = [line]

else:

current_section.append(line)

if current_section:

sections.append('\n'.join(current_section))

return sections

def _looks_like_product_title(self, line: str) -> bool:

if len(line) < 10 or len(line) > 200:

return False

product_indicators = ['GB', 'TB', 'inch', 'MHz', 'GHz', 'Model', 'Series']

return any(indicator in line for indicator in product_indicators)

This extraction system demonstrates how LLMs can handle the variability and complexity of e-commerce product pages much more effectively than traditional rule-based approaches. The _extract_single_product method uses a carefully constructed prompt that provides context about the original search query, helping the LLM determine product relevance and extract appropriate specifications.

The prompt design includes several important elements that improve extraction accuracy. First, it provides context about the original search query, allowing the LLM to better assess whether extracted products are relevant to the user’s intent. This context is crucial because product pages often contain multiple items or related products that might not match the user’s specific search. Second, it requests a specific JSON format, making the response easy to parse programmatically while ensuring consistent data structure across different extractions.

The inclusion of an is_relevant field serves as a quality filter, helping eliminate products that might appear on the page but don’t match the search criteria. This is particularly important on sites like Amazon where search results pages often include sponsored products or related items that aren’t direct matches for the user’s query.

The text preprocessing steps are crucial for effective LLM processing. The _clean_html_text method removes excessive whitespace and normalizes formatting, which helps the LLM focus on actual content rather than formatting artifacts. The _identify_product_sections method attempts to split large pages into individual product listings, which allows the system to process each product separately and leads to more accurate extraction results.

Product Matching and Comparison Logic

After extracting product information from multiple sources, the system must determine which products are equivalent across different merchants. This matching process goes beyond simple string comparison and requires understanding of product variations, alternative naming conventions, and specification equivalencies. The challenge is particularly complex because the same product might be listed with completely different titles and descriptions across different sites.

from difflib import SequenceMatcher

from typing import List, Tuple, Set

import re

class ProductMatcher:

def __init__(self, llm_client):

self.llm_client = llm_client

def find_matching_products(self, products_by_site: Dict[str, List[ProductInfo]]) -> List[Dict]:

all_products = []

for site, products in products_by_site.items():

for product in products:

product.source_site = site

all_products.append(product)

clusters = self._cluster_similar_products(all_products)

comparisons = []

for cluster in clusters:

if len(cluster) > 1:

comparison = self._create_price_comparison(cluster)

if comparison:

comparisons.append(comparison)

return sorted(comparisons, key=lambda x: x['min_price'])

def _cluster_similar_products(self, products: List[ProductInfo]) -> List[List[ProductInfo]]:

clusters = []

used_products = set()

for i, product in enumerate(products):

if i in used_products:

continue

cluster = [product]

used_products.add(i)

for j, other_product in enumerate(products[i+1:], i+1):

if j in used_products:

continue

if self._are_products_equivalent(product, other_product):

cluster.append(other_product)

used_products.add(j)

clusters.append(cluster)

return clusters

def _are_products_equivalent(self, product1: ProductInfo, product2: ProductInfo) -> bool:

if self._quick_mismatch_check(product1, product2):

return False

return self._llm_product_comparison(product1, product2)

def _quick_mismatch_check(self, product1: ProductInfo, product2: ProductInfo) -> bool:

price_ratio = max(product1.price, product2.price) / min(product1.price, product2.price)

if price_ratio > 3.0:

return True

for key, value1 in product1.specifications.items():

if key in product2.specifications:

value2 = product2.specifications[key]

if self._specifications_conflict(key, value1, value2):

return True

return False

def _specifications_conflict(self, spec_name: str, value1: str, value2: str) -> bool:

val1_norm = self._normalize_spec_value(spec_name, value1)

val2_norm = self._normalize_spec_value(spec_name, value2)

if 'capacity' in spec_name.lower() or 'storage' in spec_name.lower():

try:

num1, unit1 = self._extract_capacity(val1_norm)

num2, unit2 = self._extract_capacity(val2_norm)

if num1 and num2 and unit1 == unit2:

ratio = max(num1, num2) / min(num1, num2)

return ratio > 1.1

except:

pass

similarity = SequenceMatcher(None, val1_norm, val2_norm).ratio()

return similarity < 0.8

def _llm_product_comparison(self, product1: ProductInfo, product2: ProductInfo) -> bool:

prompt = f"""

Compare these two products and determine if they are the same item:

Product 1 (from {product1.source_site}):

Title: {product1.title}

Price: {product1.price} {product1.currency}

Specifications: {json.dumps(product1.specifications)}

Product 2 (from {product2.source_site}):

Title: {product2.title}

Price: {product2.price} {product2.currency}

Specifications: {json.dumps(product2.specifications)}

Consider:

- Are these the same product model?

- Do the specifications match or are compatible?

- Could price differences be explained by different sellers/conditions?

Answer with only "YES" if they are the same product, or "NO" if they are different products.

"""

try:

response = self.llm_client.chat.completions.create(

model="gpt-4",

messages=[{"role": "user", "content": prompt}],

temperature=0.1,

max_tokens=10

)

answer = response.choices[0].message.content.strip().upper()

return answer == "YES"

except Exception as e:

logging.error(f"LLM comparison failed: {e}")

title_similarity = SequenceMatcher(None, product1.title, product2.title).ratio()

return title_similarity > 0.7

def _create_price_comparison(self, cluster: List[ProductInfo]) -> Optional[Dict]:

if not cluster:

return None

cluster.sort(key=lambda p: p.price)

min_price = cluster[0].price

max_price = cluster[-1].price

savings = max_price - min_price

savings_percent = (savings / max_price) * 100 if max_price > 0 else 0

return {

'product_title': cluster[0].title,

'min_price': min_price,

'max_price': max_price,

'savings': savings,

'savings_percent': round(savings_percent, 1),

'offers': [

{

'site': product.source_site,

'price': product.price,

'currency': product.currency,

'availability': product.availability,

'rating': product.rating,

'review_count': product.review_count,

'title': product.title

}

for product in cluster

]

}

This matching system combines multiple approaches to handle the complexity of product comparison across different merchant sites. The system first applies quick heuristic checks to eliminate obvious non-matches before using the more expensive LLM comparison for borderline cases. This hybrid approach balances accuracy with computational efficiency.

The quick mismatch checks provide fast filtering to eliminate obvious non-matches based on significant price differences or conflicting specifications. The price ratio check prevents matching products that differ by more than 300%, which typically indicates different product categories or significantly different product tiers. However, the system must be careful not to make this threshold too restrictive, as legitimate price differences between retailers can be substantial.

The _specifications_conflict method demonstrates how the system can handle technical specifications intelligently. For capacity-based specifications like storage, it allows for reasonable manufacturing tolerances while flagging significant differences that would indicate different products. This approach prevents false matches between products with similar names but different technical specifications, such as matching a 1TB external drive with a 4TB model.

The LLM comparison prompt is designed to be decisive and focused, requesting only a YES or NO answer to avoid ambiguous responses that would be difficult to process programmatically. The prompt provides comprehensive context about both products and explicitly asks the model to consider factors like price differences that might be explained by different sellers or conditions rather than indicating different products. The max_tokens limitation ensures quick responses and prevents the LLM from providing lengthy explanations that aren’t needed for this binary decision.

Result Processing and Presentation

The final component of the system involves presenting the comparison results in a format that helps users make informed purchasing decisions. This presentation layer can leverage the LLM to generate natural language explanations of price differences, highlight key product features, and provide personalized recommendations based on the user’s original query.

from datetime import datetime

from typing import List, Dict, Any

class ResultPresenter:

def __init__(self, llm_client):

self.llm_client = llm_client

def format_search_results(self, comparisons: List[Dict],

original_query: SearchQuery) -> Dict[str, Any]:

ranked_comparisons = self._rank_comparisons(comparisons, original_query)

summary = self._generate_search_summary(ranked_comparisons, original_query)

formatted_results = []

for comparison in ranked_comparisons[:10]:

formatted_result = self._format_single_comparison(comparison, original_query)

formatted_results.append(formatted_result)

return {

'query': {

'original_input': original_query.keywords,

'processed_query': {

'product_type': original_query.product_type,

'brand': original_query.brand,

'specifications': original_query.specifications

}

'summary': summary,

'total_results': len(comparisons),

'results': formatted_results,

'search_timestamp': datetime.now().isoformat()

}

def _rank_comparisons(self, comparisons: List[Dict],

query: SearchQuery) -> List[Dict]:

def calculate_score(comparison):

score = 0

savings_score = min(comparison['savings'] / 100, 50)

score += savings_score

percentage_score = min(comparison['savings_percent'] / 2, 15)

score += percentage_score

offer_count_score = min(len(comparison['offers']) * 2, 10)

score += offer_count_score

ratings = [offer.get('rating') for offer in comparison['offers']

if offer.get('rating')]

if ratings:

avg_rating = sum(ratings) / len(ratings)

rating_score = (avg_rating - 3) * 2

score += max(rating_score, 0)

if query.price_range:

min_budget, max_budget = query.price_range

product_price = comparison['min_price']

if product_price < min_budget or product_price > max_budget:

score -= 20

return score

return sorted(comparisons, key=calculate_score, reverse=True)

def _generate_search_summary(self, comparisons: List[Dict],

query: SearchQuery) -> str:

if not comparisons:

return "No matching products found across the searched retailers."

top_comparisons = comparisons[:5]

price_ranges = [f"${comp['min_price']:.2f} - ${comp['max_price']:.2f}"

for comp in top_comparisons]

savings_data = [f"{comp['savings_percent']:.1f}%"

for comp in top_comparisons if comp['savings_percent'] > 5]

prompt = f"""

Create a brief summary for a price comparison search with these results:

Original search: {query.product_type} {query.brand or ''} {' '.join(query.keywords)}

Found {len(comparisons)} matching products across multiple retailers.

Top 5 results price ranges: {', '.join(price_ranges)}

Significant savings opportunities: {', '.join(savings_data) if savings_data else 'Limited'}

Write a 2-3 sentence summary highlighting:

1. What was found

2. Best savings opportunity (if any)

3. Key recommendation for the user

Keep it concise and helpful for someone comparing prices on {query.product_type}.

"""

try:

response = self.llm_client.chat.completions.create(

model="gpt-4",

messages=[{"role": "user", "content": prompt}],

temperature=0.3,

max_tokens=150

)

return response.choices[0].message.content.strip()

except Exception as e:

logging.error(f"Summary generation failed: {e}")

best_savings = max((comp['savings_percent'] for comp in comparisons), default=0)

return f"Found {len(comparisons)} matching products with savings up to {best_savings:.1f}%."

def _format_single_comparison(self, comparison: Dict,

query: SearchQuery) -> Dict[str, Any]:

offers = comparison['offers']

best_offer = offers[0]

explanation = self._explain_price_differences(comparison, query)

return {

'product': {

'title': comparison['product_title'],

'category': query.product_type

'price_analysis': {

'best_price': best_offer['price'],

'highest_price': offers[-1]['price'],

'currency': best_offer['currency'],

'savings_amount': comparison['savings'],

'savings_percent': comparison['savings_percent'],

'explanation': explanation

'offers': [

{

'retailer': offer['site'],

'price': offer['price'],

'availability': offer['availability'],

'rating': offer.get('rating'),

'review_count': offer.get('review_count'),

'is_best_price': offer == best_offer

}

for offer in offers

'recommendation': self._generate_purchase_recommendation(comparison, query)

}

def _explain_price_differences(self, comparison: Dict, query: SearchQuery) -> str:

offers = comparison['offers']

if len(offers) < 2:

return "Single retailer found for this product."

price_spread = comparison['max_price'] - comparison['min_price']

if price_spread < 5:

return "Prices are very similar across retailers."

elif comparison['savings_percent'] > 20:

return f"Significant price variation found - save {comparison['savings_percent']:.1f}% by choosing the best offer."

else:

return f"Moderate price differences - potential savings of ${price_spread:.2f}."

def _generate_purchase_recommendation(self, comparison: Dict, query: SearchQuery) -> str:

best_offer = comparison['offers'][0]

# Consider factors beyond just price

offers_with_ratings = [o for o in comparison['offers'] if o.get('rating')]

if offers_with_ratings:

best_rated = max(offers_with_ratings, key=lambda x: x['rating'])

if best_rated != best_offer and best_rated['rating'] > 4.5:

price_diff = best_rated['price'] - best_offer['price']

if price_diff < comparison['savings'] * 0.3: # Within 30% of max savings

return f"Consider {best_rated['site']} for better ratings despite ${price_diff:.2f} higher price."

return f"Best value at {best_offer['site']} - lowest price with {best_offer['availability']}."

# Main orchestration class that ties everything together

class PriceComparisonEngine:

def __init__(self, llm_client, scrapers: List[BaseScraper]):

self.query_processor = QueryProcessor(llm_client)

self.scraping_orchestrator = ScrapingOrchestrator(scrapers)

self.data_extractor = LLMDataExtractor(llm_client)

self.product_matcher = ProductMatcher(llm_client)

self.result_presenter = ResultPresenter(llm_client)

async def search_products(self, user_query: str) -> Dict[str, Any]:

# Process the user's natural language query

structured_query = self.query_processor.process_query(user_query)

# Scrape multiple sites concurrently

raw_results = await self.scraping_orchestrator.search_all_sites(structured_query)

# Extract structured data from each site's results

extracted_products = {}

for site, html_results in raw_results.items():

site_products = []

for html_content in html_results:

products = self.data_extractor.extract_product_info(

html_content, site, structured_query

)

site_products.extend(products)

extracted_products[site] = site_products

# Match equivalent products across sites

comparisons = self.product_matcher.find_matching_products(extracted_products)

# Format and present results

formatted_results = self.result_presenter.format_search_results(

comparisons, structured_query

)

return formatted_results

# Example usage demonstration

async def example_usage():

# Initialize LLM client (example with OpenAI)

llm_client = openai.OpenAI(api_key="your-api-key")

# Set up scrapers for different sites

scrapers = [

AmazonScraper(None),

# Additional scrapers would be added here

# BestBuyScraper(None),

# NeweggScraper(None),

]

# Create the main engine

engine = PriceComparisonEngine(llm_client, scrapers)

# Perform a search

results = await engine.search_products("Braun Series 9 electric shaver")

# Display results

print(f"Search Summary: {results['summary']}")

print(f"Found {len(results['results'])} product comparisons")

for result in results['results'][:3]: # Show top 3 results

product = result['product']

analysis = result['price_analysis']

print(f"\nProduct: {product['title']}")

print(f"Price range: ${analysis['best_price']:.2f} - ${analysis['highest_price']:.2f}")

print(f"Potential savings: {analysis['savings_percent']:.1f}%")

print(f"Recommendation: {result['recommendation']}")

Implementation Challenges and Solutions

Building an LLM-powered price comparison system presents several unique challenges that require careful consideration and robust solutions. The most significant challenge involves managing the cost and latency of LLM API calls. Each search query potentially requires multiple LLM interactions for query processing, data extraction, product matching, and result presentation. With modern LLM APIs charging per token, costs can accumulate quickly, especially for high-traffic applications.

To address cost concerns, the system should implement intelligent caching strategies. Query processing results can be cached for common search terms, since the structured representation of “Braun shaver” will be consistent across users. Similarly, product extraction results can be cached with short expiration times, since product information on merchant sites changes relatively slowly compared to the frequency of user queries.

Rate limiting presents another significant challenge, both from LLM providers and from target merchant websites. The system must implement sophisticated backoff strategies that respect API rate limits while maintaining acceptable response times. For merchant sites, the system should employ techniques like rotating user agents, implementing delays between requests, and using proxy rotation when necessary.

Data quality and consistency represent ongoing challenges in web scraping applications. Merchant sites frequently change their layouts, implement new anti-bot measures, or modify their product information formats. The system should include monitoring mechanisms that detect when scrapers begin returning unexpected results, triggering alerts for manual review and scraper updates.

Another significant challenge involves handling the non-deterministic nature of LLM responses. While using low temperature settings helps ensure consistency, LLMs can still occasionally produce unexpected outputs or fail to follow the requested format. The system must include robust validation and fallback mechanisms that can handle malformed responses gracefully without causing system failures.

Performance Considerations

The performance characteristics of an LLM-powered price comparison system differ significantly from traditional search systems due to the computational overhead of language model processing and the latency of external API calls. Understanding these performance implications is crucial for designing a system that provides acceptable user experience while managing operational costs effectively.

Latency optimization requires careful orchestration of the various system components. The scraping operations should be parallelized across different merchant sites to minimize total scraping time. However, the LLM processing steps often need to be sequential, as each step depends on the results of the previous step. The system can optimize this by batching multiple LLM requests where possible, such as processing multiple product extractions in a single API call.

Caching strategies become crucial for managing both performance and costs. The system should implement multi-level caching, starting with query processing results that can be reused across similar searches. Product information extracted from merchant sites can be cached for short periods, typically 15-30 minutes, to balance freshness with performance. Price information requires more frequent updates, but even brief caching can significantly reduce the load on merchant sites.

The system should also implement progressive result presentation, showing initial results as they become available rather than waiting for all scrapers to complete. This approach significantly improves perceived performance, as users can begin evaluating options while additional sites are still being processed.

Error handling and resilience are particularly important in price comparison systems, where temporary failures of individual components should not prevent the system from providing useful results. The system should gracefully degrade when individual scrapers fail, continuing to provide comparisons from successful scrapers while logging failures for later analysis.

Future Enhancements and Conclusion

The integration of LLMs into price comparison systems represents a significant advancement in handling the complexity and variability of e-commerce data. However, several areas present opportunities for future enhancement and optimization.

Machine learning models could be trained to improve product matching accuracy over time, learning from user feedback about whether suggested matches are correct. This approach would combine the flexibility of LLMs with the efficiency of specialized models trained on specific matching tasks.

Real-time price tracking capabilities could be added to monitor price changes for products that users have shown interest in, providing alerts when prices drop or when new retailers begin offering competitive prices. This feature would require more sophisticated data storage and background processing capabilities.

The system could be enhanced with inventory tracking, providing users with information about product availability and estimated delivery times in addition to price comparisons. This enhancement would require integration with retailer APIs where available, or enhanced scraping to extract availability information.

Integration with coupon and promotional code databases could provide users with additional savings opportunities beyond simple price comparisons. This feature would require maintaining databases of current promotional offers and applying them to price calculations.

The current implementation demonstrates how LLMs can transform traditional price comparison systems by handling the semantic complexity of product matching and providing intelligent user interfaces. While challenges exist around cost, latency, and reliability, the benefits in terms of accuracy and user experience make LLM integration a compelling approach for modern price comparison applications.

The system architecture presented here provides a solid foundation for building production-ready price comparison services. The modular design allows for incremental improvements and additions without requiring complete system redesigns. As LLM technologies continue to evolve and become more cost-effective, we can expect to see even more sophisticated applications of these techniques in e-commerce and price comparison systems.

The key to successful implementation lies in balancing the power of LLMs with practical considerations around cost, performance, and reliability. By carefully designing the system architecture and implementing appropriate caching, error handling, and optimization strategies, developers can create price comparison systems that provide significant value to users while maintaining sustainable operational characteristics.

Hitchhiker's Guide to AI, Software Architecture, and Everything Else

Wednesday, August 06, 2025

Building an LLM-Powered Meta Search Engine for Price Comparison

Introduction and Conceptual Overview

System Architecture and Component Design

Query Processing with LLM Integration

Web Scraping Infrastructure

Data Extraction and Normalization Using LLMs

Product Matching and Comparison Logic

Result Processing and Presentation

Implementation Challenges and Solutions

Performance Considerations

Future Enhancements and Conclusion

No comments:

About Me