Siirry suoraan sisältöön

Data Preprocessing (4 cr)

Code: TTC8030-3011

General information


Enrollment

18.11.2024 - 09.01.2025

Timing

20.01.2025 - 16.02.2025

Number of ECTS credits allocated

4 op

Virtual portion

4 op

Mode of delivery

Online learning

Unit

School of Technology

Teaching languages

  • English

Seats

0 - 35

Degree programmes

  • Bachelor's Degree Programme in Information and Communications Technology
  • Bachelor's Degree Programme in Information and Communications Technology

Teachers

  • Antti Häkkinen

Groups

  • TTV22S5
    Tieto- ja viestintätekniikka (AMK)
  • TTV22S2
    Tieto- ja viestintätekniikka (AMK)
  • TTV22S3
    Tieto- ja viestintätekniikka (AMK)
  • TIC22S1
    Bachelor's Degree Programme in Information and Communications Technology
  • TTV22S1
    Tieto- ja viestintätekniikka (AMK)
  • TTV22SM
    Tieto- ja viestintätekniikka (AMK)
  • TTV22S4
    Tieto- ja viestintätekniikka (AMK)
  • TTV22SM2
    Tieto- ja viestintätekniikka (AMK)
  • ZJA25KTIDA1
    Avoin amk, Data-analytiikka 1, Verkko
  • 23.01.2025 16:30 - 17:30, Data Preprocessing [Opening lecture]

Objective

After the course, you will understand the data analytics process and the challenges it brings. You can identify different data formats, the most common interface solutions and the tools and methods used in data preprocessing. In addition, you know how to apply the methods necessary for data preprocessing.

EUR-ACE Competences:
Knowledge and Understanding
Engineering Practice

Content

o Various data sources and data formats, JSON, APIt, retrieval of tables from SQL
o Types of variables
o Data preprocessing before bringing it into analysis program
o Data preprocessing in Pandas (basics of Pandas/DataFrames)
o Connecting various data sources

Location and time

The course is implemented online (no contact teaching). The student can proceed at his own pace during the course.

Oppimateriaali ja suositeltava kirjallisuus

Course website (lecture material, exercises, other instructions).

Teaching methods

The course includes assignments from different subject areas of the course.

Student workload

Environment preparations, exercises and familiarisation with the material 108 h. Total 108 h

Evaluation scale

0-5

Arviointikriteerit, tyydyttävä (1-2)

Satisfactory 2: The student masters data retrieval from a selected source. You are able to implement data preprocessing to datasets. You are able to apply simple methods used in data preprocessing to his/her data. You can assess your own solutions for data preprocessing.

Sufficient 1: You know and understand the significance of data and its advantages. You know the significance of data preprocessing and its most common methods. You are able to apply simple methods obtained and used in data preprocessing.

Arviointikriteerit, hyvä (3-4)

Very good 4: You master the retrieval of data from various sources. You are able to plan and implement data preprocessing for various datasets. You are able to apply the methods used in data preprocessing widely. You can assess and justify your own solutions for data preprocessing.

Good 3: You master data retrieval from several sources. You are able to plan and implement data preprocessign for datasets. You are able to apply methods used in data preprocessing. You can to assess and justify your own solutions for data preprocessing.

Assessment criteria, excellent (5)

Excellent 5: You master the retrieval of data from various sources. You are able to plan and implement data processing for various datasets. You are able to apply the methods used in data preprocessing. You can critically assess and justify your own solutions for data preprocessing.

Qualifications

Basic skills of computer use, basic skills in programming, knowledge of Python programming language.