script to scrape brand names from listing pages of amazon under different categories
- parse.py : scrape brand names from hierarchical category pages and store in dict format
- brands.py : extract salient tokens from textual data using occurrence and co-occurrence with other tokens. Combine partial tokens occurring in same text sample into one. Tag documents with extracted tokens