Page 26 - Proceeding The 2nd International Seminar of Science and Technology : Accelerating Sustainable Innovation Towards Society 5.0
P. 26
nd
The 2 International Seminar of Science and Technology
“Accelerating Sustainable innovation towards Society 5.0”
ISST 2022 FST UT 2022
Universitas Terbuka
the initial centroid cluster candidate in random [21]. Here are the steps
to use the K-Means method:
a. Specify k as the number of clusters you want to form.
b. Comparing random values for the initial center cluster (centroid) by
k.
c. Calculates the distance of each input data to each centroid using
the distance formula (Euclidean Distance) until the closest distance
of each data is found with the centroid (Equation 1).
D(xi,µj) = √∑(xi − μj) (1)
2
d. Classify each data based on its proximity to the centroid (the
smallest distance).
Renews the centroid value. The new centroid value is obtained
from the average of the cluster concerned using Equation 2, where
μj(t+1: new centroid on iteration to (t+1), and Nsj: lots of data on
the Sj cluster.
µj(t+1) = 1 ∑ xj (2)
Nsj j∈sj
e. Loop from steps 2 to 5, until none of the members of each cluster
have changed.
3 RESULT
3.1 Descriptive Statistics
Descriptive statistics is an activity in data collection, structuring,
summarizing, and presenting data in the hope that the data is more
meaningful and easier to understand. In this section, data exploration
will be carried out which aims to describe the specifics of the data. The
data used is data on the spread of dengue disease sufferers using the
data attributes of the number of dengue sufferers in 2020 and
geographical factor attributes in the form of population density, the
number of health facilities per district in East Lombok Regency. Here
will be shown some graphs that depict the characteristics of the data.
ISST 2022 – FST Universitas Terbuka, Indonesia 5
International Seminar of Science and Technology “Accelerating Sustainable
Towards Society 5.0