This is the NewsFinder software, designed to automatically crawl the web for news related to artificial intelligence, filter, categorize, and rank the news, and publish to a wiki, mailing list, and RSS feeds.
Create a branch of the software that puts a news story in several high-ranking categories. See if that helps the "appearance" of too many robot stories (a story labeled as "Vision, Robots" won't appear to be a robot story misclassified or dominating).
In retrospect, the obvious solution to working around NAT firewalls is to use a cookie with a unique identifier embedded in it. The server can automatically generate it when contacted by a client that doesn't provide one, and return it with the page markup for the NewsFinder page. Thereafter, the browser will supply the cookie back to the server, so you can track the activity of an unknown user.
This also solves the problem of users moving around the Internet and connecting to Newsfinder with different DHCP-assigned IP addresses.
Suppose somebody submits a news article via the website, and Bruce is emailed but opts not to upload the submission on the wiki. Does the AINews software respect this decision or does the software still process the submission regardless?
Instead of a centroid classifier, classifier articles in categories with SVMs; try multiclass (choose one category) and one-against-all (choose several good categories).
Email Tom & Liang about what's being done there (investigate first). Figure out a way to capture outbound clicks and plot it (put into Google Analytics?)
Like AIMag articles by Glick, in folder assets/; consider building an automated journalist that processes the AINews stories and summarizes them, for publication.
Make sure, in the code, that the AINews stuff is actually appearing, correctly. Also, what is the process for truncating based on time? Is it doing that?
cognitive science (or cognitive sciences … May I assume that stemming will obviate the need for singular and plural terms)
brain science
cognition
induction
causal reasoning
automatic translation
automatic translator
machine translator
translation system
machine translation … or even translation, given that we are already selecting sources
expert system
poker program
co-occurrence of “computer” and “brain” or “computer” and “brain” or “computer” and “recognition” or “computer” and “think”
computer vision
machine vision
pattern recognition
pattern analysis
image recognition
image analysis
voice recognition
speech recognition
avatar
computer music
computer art
neural network
semantic analysis
semantic search
recommendation system
recommendation engine
artificial thinking … question … does NewsFinder remove quotes prior to analysis? I ask because the phrase that appeared was: artificial 'thinking head'.