Page 26 - Proceeding The 2nd International Seminar of Science and Technology : Accelerating Sustainable Innovation Towards Society 5.0
P. 26

nd
               The 2  International Seminar of Science and Technology
               “Accelerating Sustainable innovation towards Society 5.0”
               ISST 2022 FST UT 2022
               Universitas Terbuka
               the initial centroid cluster candidate in random [21]. Here are the steps
               to use the K-Means method:
               a.  Specify k as the number of clusters you want to form.
               b.  Comparing random values for the initial center cluster (centroid) by
                  k.
               c.  Calculates the distance of each input data to each centroid using
                  the distance formula (Euclidean Distance) until the closest distance
                  of each data is found with the centroid (Equation 1).
                   D(xi,µj) = √∑(xi − μj)                            (1)
                                      2
               d.  Classify  each  data  based  on  its  proximity  to  the  centroid  (the
                  smallest distance).
                  Renews  the  centroid  value.  The  new  centroid  value  is  obtained
                  from the average of the cluster concerned using Equation 2, where
                  μj(t+1: new centroid on iteration to (t+1), and Nsj: lots of data on
                  the Sj cluster.
                   µj(t+1) =   1   ∑  xj                             (2)
                           Nsj  j∈sj
               e.  Loop from steps 2 to 5, until none of the members of each cluster
                  have changed.

               3   RESULT
               3.1  Descriptive Statistics
               Descriptive  statistics  is  an  activity  in  data  collection,  structuring,
               summarizing, and presenting data in the hope that the data is more
               meaningful and easier to understand. In this section, data exploration
               will be carried out which aims to describe the specifics of the data. The
               data used is data on the spread of dengue disease sufferers using the
               data  attributes  of  the  number  of  dengue  sufferers  in  2020  and
               geographical factor  attributes in the form of population density, the
               number of health facilities per district in East Lombok Regency. Here
               will be shown some graphs that depict the characteristics of the data.








               ISST 2022 – FST Universitas Terbuka, Indonesia              5
               International Seminar of Science and Technology “Accelerating Sustainable
               Towards Society 5.0
   21   22   23   24   25   26   27   28   29   30   31