Sites using uClassify

January 24th, 2010

I’ve started to list sites that are using uClassify, I’ll update this list every now and then. Please comment if your site is missing.

UrlAi

UrlAi.com

Classifies blogs on gender, age, mood and tonality. Blogs are followed over time to give more accurate results.

TrollGuard

TrollGuard.com

TrollGuard is a free WordPress plugin that protects your blog from spam comments.

Typealyzer

Typealyzer.com

This innovative site finds out the blog author personality, using a psychological text analysis. Check it out!

GenderAnalyzer

GenderAnalyzer.com

This neat web site figures out if a blog is written by a man or woman, using the uclassify web service.

AgeAnalyzer

AgeAnalyzer.com

Tries to guess the author age from reading a blog.

oFaust

oFaust.com

See what classical author your text is most alike, perhaps it can be used to help write texts in e.g. Shakespearian style! It hightlights words and sentances that are characteristic for the author.

EpiSPIDER AI Mashup 2.0

EpiSPIDER is a tool that demonstrates connectivity between “consumers”, “producers” and “transformers” of data within an emerging information and knowledge architecture.

hombreomujer.com

Tries to guess an authors gender, for spanish texts.

fidelofranco.com

Are you writing like Fidel or like Franco? For texts in spanish.

tuideologia.com

Tries to guess left/right wing alignment for spanish texts.

Trve vs Emo

Are text more aligned aginst black metal bands, such as Mayhem, Burzum and Darkthrone or emo bands like My Chemical Romance and Fall out boy.

MDG Actors

Highlights the people that are making a big difference and call out the people that aren’t for reaching the Millennium Development Goals? Not sure if this site is still working.

BloggParti.se

Tests texts against the major political parties in Sweden. Only for Swedish texts.

UrlAi.com - who are you?

January 23rd, 2010

UrlAi

We have created a new service called UrlAi.com, the basic concept is to run blog posts through a bunch of classifiers over time. To begin with we use Gender, Age, Mood and Tonality but the system is dynamic so we can add new classifiers at any time. If you have created a classifier that would fit on urlai.com let us know!

Some ideas

We have many ideas of how we can develop this project further, for example, now we are only showing a summary pie chart, it would be nice to see posts over time. User feedback for online training and classifier improvement may be possible. Another thing we could do is to have classified posts searchable, for example, enabling users to see the mood of everyone who mentioned ‘Avatar’.

Some kudos

Just want to thank the people that has been involved in this project, Roger Karlsson for coding, Johanna Forsman for the awesome logo and Mattias Östmar for sharing his Tonality and Mood classifiers. Mattias has also contributed with many ideas around this, being the idea fountain he is :D

Artificial Intelligence to determine an authors age

November 25th, 2009

Young and old people

We have just released ageanalyzer.com, a site that reads a blog and guesses the age of the author!

Background

Our writing style reflects us in many ways, for example texts written in anger probably differs from words written in joy.  Reading a text intuitively gives us a clue about the author as you start forming a picture in your head.  Sometimes it’s easy to pinpoint how you got this picture and at other times harder.

We wanted to know if we could give computers the same intuition, in this particular project we are interesting in finding out if a computer can tell the age of an author - only given a text.

To do this experiment we collected 7000 blogs that had age information in the profile and split it into 6 different age groups, 13-17, 18-25, 26-35, 36-50, 51-65 and 65+. We then created a classifier on uClassify and fed it with the training data. Viola!

Expected results

After running tests on the training data (10-fold-cross-validation) it was clear that our classifier was able to find differences between the six age groups. We expect the proportion of correctly classified blogs would be around 30% compared to a baseline of 17% which would be expected if the classifier was guessing out of the blue.

We have added a poll to the site to help us see how well (or poorly) it works!

Try AgeAnalyzer out here!

Feedback, anyone?

November 5th, 2009

One of the most popular published classifiers is Language detection which classified more than 800000 texts last week. However, this is just the top of the iceberg as most classifiers are unpublished (about 500 classifiers). All classifiers are of course not active however a good number are, which brings me to my question. How does it work? I’m not getting much feedback or support - I am not sure if this is good or bad.

If you read this and are using uClassify for a project feel more than free to contact me with any positive or negative feedback on any aspect (classifier performance, documentation, response times or any unclarity). You may leave a comment or e-mail me at this address: contact at uclassify dot com. <– Are spambots able to read this now days?

Over and out!

Connection pool bug fix

November 1st, 2009

Some users have experienced

“Timeout expired.  The timeout period elapsed prior to obtaining a connection from the pool.  This may have occurred because all pooled connections were in use and max pool size was reached.”

This happened because some database connections persisted longer than expected and as a result the connection pool was drained. This became obvious with the release of the URL API. The bug should have been fixed now.

I’m also really happy to see that we are getting a lot new users since the introduction of the URL API!

URL classification API with JSON responses

October 25th, 2009

Many people have contacted me asking for JSON support. I’ve not used it myself earlier but it seems like a widely used format, so I decided to implement support in our new URL API.

Just add the the argument output=json on the end of a classification URL request.

Please let me know if there is anything that looks weird in the JSON API or if something should be redesigned.

Beer…. is the word more used by men or women?

http://uclassify.com/browse/uClassify/GenderAnalyzer_v5/ClassifyText?readkey=YourReadKeyHere&text=beer&output=json

If you click will be asked to save a file due to the MIME content type application/json, however this is how the response looks:

{
“version” : “1.00″,
“success” : true,
“statusCode” : 2000,
“errorMessage” : “”,
“cls1″ :
{
“female” : 0.15623,
“male” : 0.84377
}
}

New API for classification

October 21st, 2009

We are really happy to announce that we just have released an URL API to make it easier to access classifiers. This means that users can classify via an URL, for example finding out the language of the text snippet from Wiki:

“En apprentissage automatique, le terme de classifieur linéaire représente une famille d’algorithmes de classement statistique.”

http://www.uclassify.com/browse/uClassify/Text-Language/ClassifyText?readKey=YourReadKeyHere&text=En%20apprentissage%20automatique%2C%20le%20terme%20de%20classifieur%20lin%C3%A9aire%20repr%C3%A9sente%20une%20famille%20d’algorithmes%20de%20classement%20statistique.

You can also classify URLs directly. For example this call confirms that Rogers blog is written by a male:

http://www.uclassify.com/browse/uClassify/GenderAnalyzer_v5/ClassifyUrl?readKey=YourReadKeyHere&url=http%3A%2F%2Fwww.rogerkarlsson.com&removeHtml=1

The new API is documented here.

Faster site

October 10th, 2009

I’ve spent this Saturday to rewrite the logging system, as it turned out the old logs were not getting pruned and growing really big. I needed to refactor parts of the logging architecture in order to fix this. By doing so I decided to nuke all old logs instead of wasting time to convert them into the new tables. This is why the ‘calls’ count is reset on all classifiers. I did keep the total number of classifications though - and we have now passed 6 million of them!

As a result this site is much faster to browse, the biggest contrast can be seen when browsing the classifiers.

Keep the classifications going!!

Typealyzer is back

September 17th, 2009

Typealyzer has been down for quite some time. The reason has been trouble with the domain name, but the inventor, Mattias Östmar finally got it working with help from Annika Lidne. Good work!

Happy times!

Sorry for unstable server tonight

August 21st, 2009

I had to restart the Windows server tonight (for the first time in more 6 months), while doing this I decided to install some Windows updates. This took more time than expected (2 hours). Everything should be up and running now though, sorry for any inconvenience!