Package 'recommenderlabJester'

Title: Jester Dataset for 'recommenderlab'
Description: Provides the Jester Dataset for package recommenderlab.
Authors: Michael Hahsler [aut, cre, cph]
Maintainer: Michael Hahsler <[email protected]>
License: GPL-2
Version: 0.2-0
Built: 2024-09-03 02:34:39 UTC
Source: https://github.com/mhahsler/recommenderlabJester

Help Index


Jester Dataset

Description

The data set contains the anonymous ratings data from the Jester Online Joke Recommender System collected between April 1999 and May 2003.

Usage

data(Jester)

Format

The format is: Formal class 'realRatingMatrix' [package "recommenderlab"]

Details

24983 x 100 rating matrix (24983 users and 100 jokes) with 1,810,455 ratings between -10.00 and +10.00. All selected users have rated 15 or more jokes.

The text for the jokes are also available as a character vector of length 100 in JesterJokes.

References

Ken Goldberg, Theresa Roeder, Dhruv Gupta, and Chris Perkins. "Eigentaste: A Constant Time Collaborative Filtering Algorithm." Information Retrieval, 4(2), 133-151. July 2001.

Examples

data(Jester)
Jester

hist(getRatings(Jester), main = "Distribution of ratings")

# what is the best joke? (highest average rating)
best <- which.max(colMeans(Jester))
cat(JesterJokes[best])