Start

14/10/2025

Close

Housing Prices Challenge

Predict sale price of a house based on various features

Challenge Rewards:

knowledge

Participants

162

Submissions

157

Housing Prices Data

Summary

Introduction

This dataset is based on Gigasheet(2023) that contains detailed real estate listings, including property type, price, location, bedrooms, bathrooms, area (in Marla), and purpose (rent/sale). Designed for housing market analysis, it supports trend assessment, comparative pricing, and data-driven investment decisions. It is also used for the Housing Price Challenge, enabling predictive modeling and valuation insights.

Dataset Structure

Download

Download Housing Prices Data.

Include

The dataset consists of two files:

  • train.csv:
    • Used for training machine learning models.
    • Contains 69,649 entries with detailed property features (e.g., type, price, location) and corresponding sale prices.
  • test.csv:
    • Contains 29,850 entries for evaluation.
    • Note:
      • The test data (test.csv) is used to predict labels for generating the submission file.
      • Submissions are evaluated and ranked on the leaderboard

An example from train.csv:

house_index,property_type,price,location,city,baths,purpose,bedrooms,Area_in_Marla
0,Flat,10000000,G-10,Islamabad,2,For Sale,2,4.0

An example from test.csv:

house_index,property_type,location,city,baths,purpose,bedrooms,Area_in_Marla
110571,House,Bahria Town Karachi,Karachi,3,For Sale,3,8.0

Data Fields

The train.csv has the following fields:

  • House_index: Unique identifier for each property.
  • Property_type: Type of the property (e.g., House, Flat,...).
  • Location: Specific location or neighborhood of the property.
  • City: City in which the property is located.
  • Baths: Number of bathrooms.
  • Bedrooms: Number of bedrooms.
  • Area_in_marla: Total area of the property in marla.
  • Purpose: Indicates whether the property is for sale or rent.
  • Price: Target variable representing the property price.

Reference

For more information about the dataset, please visit the gigasheet website.

License

This repository is licensed under the MIT License.