Skip to the content.

Datasets

Overview

Planwise combines multiple data sources to create a comprehensive recommendation system. Our data foundation includes user-place interactions, place metadata, and user reviews.

Core Data Sources

User-Place Ratings

File: rating-California.csv

This dataset contains user ratings of places on a scale of 1-5 stars:

Active User List

File: filtered_users.csv

To ensure quality recommendations, we focus on users with significant engagement:

Place Metadata

File: meta-California.json

Rich metadata about each venue:

Madrid Places

File: combined_places.csv

Comprehensive dataset of Madrid venues:

Review Sample

File: review-California_10.json

Sample of user reviews:

Derived Datasets

Through our preprocessing pipeline, we generate several intermediate datasets:

Merged User-Place-Category Dataset

File: users_ratings_categories.csv

User-Category Aggregation

Files:

These files provide aggregated metrics of how users rate different categories:

Active-Category Filtered Dataset

File: filtered_users_over_20_categories.csv

Normalized Category Dataset

File: final_users_over_20_categories.csv

Data Quality

All datasets undergo thorough validation: