1 Exploratory data analysis

This website intends to present the work analysis for the “Análise Exploratória de Dados” class.

The ideia is Practicing R using a real anonymized Czech bank transactions, account info, and loan records released for PKDD’99 Discovery Challenge.

Use the menu above to navigate and see the final report.

1.1 Professor

1.2 Authors / students

1.3 Where to find the source code of this project?

This project can be found and downloaded on GitHub: https://github.com/ldaniel/R_Bank_Berka

Valar Morghulis! :)

2 About the data

Data from a real Czech bank. From 1999.

The data about the clients and their accounts consist of following relations:

Each account has both static characteristics (e.g. date of creation, address of the branch) given in relation “account” and dynamic characteristics (e.g. payments debited or credited, balances) given in relations “permanent order” and “transaction”. Relation “client” describes characteristics of persons who can manipulate with the accounts. One client can have more accounts, more clients can manipulate with single account; clients and accounts are related together in relation “disposition”. Relations “loan” and “credit card” describe some services which the bank offers to its clients; more credit cards can be issued to an account, at most one loan can be granted for an account. Relation “demographic data” gives some publicly available information about the districts (e.g. the unemployment rate); additional information about the clients can be deduced from this.

Source: This database was prepared by Petr Berka and Marta Sochorova.