Exploring factors for melodic diversification of folk songs in the Ryukyu Archipelago

Yuri Nishikawa; Yasuo Ihara

doi:10.1017/ehs.2025.10010

Exploring factors for melodic diversification of folk songs in the Ryukyu Archipelago

Published online by Cambridge University Press: 22 July 2025

Yuri Nishikawa

and

Yasuo Ihara

Show author details

Yuri Nishikawa: Affiliation:
Department of Biological Sciences, The University of Tokyo, Bunkyo-ku, Tokyo, Japan Department of Molecular Life Science, Tokai University School of Medicine, Isehara-shi, Kanagawa, Japan
Yasuo Ihara*: Affiliation:
Department of Biological Sciences, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
*: Corresponding author: Yasuo Ihara; Email: iharay@bs.s.u-tokyo.ac.jp

Article contents

Abstract
Social media summary
Introduction
Materials and methods
Results
Discussion
Supplementary material
Author contributions
Financial support
Conflicts of interest
Research transparency and reproducibility
References

Abstract

Cultural evolution of traditional music around the world has been the subject of recent quantitative investigations. Researchers have explored cultural diffusion of music as well as patterns of geographic variation that may result. By comparison, less has been studied about the process of music diversification; in particular, under what circumstances music diversifies is yet to be understood. In this study, we examine possible factors that may facilitate music diversification, using data from folk songs in the Ryukyu Archipelago, south-western islands of Japan. For a quantitative analysis, we first transform the melody of each folk song, following an automated scheme, into a sequence of alphabets, which is then used to quantify the melodic dissimilarity between each pair of songs. Our particular interest is in the dissimilarity between putative sister songs, or songs that are inferred to have derived from a common origin, and factors that have positive or negative effects on it. Our results suggest that sister songs tend to diversify more when they are sung in different islands, probably as a result of one being transmitted from one island to another, and when they have come to be sung in different social contexts.

Keywords

cultural evolution evolution of music folk songs melodic variation Ryukyu Archipelago

Information

Type: Research Article
Information: Evolutionary Human Sciences , Volume 7 , 2025 , e23

DOI: https://doi.org/10.1017/ehs.2025.10010 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2025. Published by Cambridge University Press.

Social media summary

Folk songs in the Ryukyu Archipelago, Japan, diversify more when sung in different islands or social contexts

1. Introduction

A growing number of studies have investigated the diversity of traditional music around the world using quantitative methods (Mehr et al., Reference Mehr, Singh, Knox, Ketter, Pickens-Jones, Atwood, Lucas, Jacoby, Egner, Hopkins, Howard, Hartshorne, Jennings, Simson, Bainbridge, Pinker, O’Donnell, Krasnow and Glowacki2019; Passmore et al., Reference Passmore, Wood, Barbieri, Shilton, Daikoku, Atkinson and Savage2024; Savage, Reference Savage2019; Wood et al., Reference Wood, Kirby, Ember, Silbert, Passmore, Daikoku, McBride, Paulay, Flory, Szinger, D’Arcangelo, Bradley, Guarino, Atayeva, Rifkin, Baron, El Hajli, Szinger and Savage2022). Although these studies have illuminated patterns of music diversity, less attention has been paid to under what circumstances the diversification is likely to occur. For example, Nishikawa and Ihara (Reference Nishikawa and Ihara2022) analysed the geographic variation of folk songs in the Ryukyu Archipelago of Japan using a cultural evolutionary approach, and suggested that horizontal transmission between regions played a major role in shaping the observed patterns of folk song diversity; however, the sort of factors that facilitated the diversification process remain unexplored.

In this study, we aim to examine possible factors that may have promoted diversification of folk songs. For that purpose, we focus on groups of folk songs in the Ryukyu Archipelago that are inferred to have derived from a single origin and evaluate the effects of possible factors on the diversity within those groups. In each region of the Ryukyu Archipelago, folk songs developed under the influence of Ryukyuan classical music of the noble class of the Ryukyu Kingdom from the fifteenth century onwards and respective local traditions (Yamanouchi, Reference Yamanouchi1959; Uchida, Reference Uchida1983; Nippon Hoso Kyokai [NHK], 1989–1993). NHK, the Japanese public broadcasting company, has collected recordings of these folk songs from around the archipelago. Traditionally, songs have been transmitted orally, but recently they may also be learnt by listening to recordings. When there is accompaniment to songs, the most commonly used instrument throughout the archipelago is the sanshin, a stringed instrument. Accompaniment by the sanshin is sometimes transcribed by adapting musical notation used in Ryukyuan classical music, called kunkunshi (NHK, 1989–1993).

Within the Ryukyu Archipelago, there are cases in which a pair of folk songs exhibit a close similarity despite being sung in distant islands, and other cases in which songs are quite dissimilar to each other despite having the same title (NHK, 1989–1993). Ethnomusicologists have attempted to disentangle the complex history of the folk songs in the Ryukyu Archipelago, and proposed a shared origin for some groups of songs based on a general judgement of similarities in melodies, lyrics, and titles (NHK, 1989–1993). One objective of the present study is to examine whether these qualitative identifications of sister songs are supported by a simplified quantitative method based strictly on melodic similarity. Another is to infer the roles of geography, time, and social context in the process of song diversification by evaluating the effects of these factors on the diversity within closely related songs.

Regarding the effect of geography, cultural transmission of songs within a narrow geographic area may occur through repeated opportunities of singing and hearing, and if so, the transmission is expected to be relatively accurate. By comparison, transmission of songs over a long distance may result from a single or a few instances of singing/hearing, when the singers or hearers travel somewhere distant from their places of residence, in which case, the transmission is expected to be more error-prone. In particular, the isolation between islands by the sea is expected to have a large effect on song diversification, for the same reason that the islands in the Ryukyu Archipelago exhibit rich biodiversity and endemism (Chiang & Schaal, Reference Chiang and Schaal2006; Motokawa, Reference Motokawa2000; Ota, Reference Ota1998). As for time, we investigate possible effects of temporal changes in melodies in a given place (or to put it differently, the amount of mutation in the vertical transmission of melodies) by taking into consideration the difference of the recording years of songs. Finally, we focus on the difference in social contexts in which the folk songs are sung. Nishikawa and Ihara (Reference Nishikawa and Ihara2022) demonstrated the importance of social context in song transmission by showing that songs sung in the ‘work’ context exhibited larger divergence between regions than those sung in other contexts, and that the variation in the work songs are associated with the linguistic variation within the Ryukyu Archipelago. Therefore, we expect that sister songs diversify more when they come to be sung in different social contexts.

Our focus on melody is motivated by the results of a previous study. Nishikawa and Ihara (Reference Nishikawa and Ihara2022) measured the distances between 1,342 folk songs in the Ryukyu Archipelago using CantoCore (Savage et al., Reference Savage, Merritt, Rzeszutek and Brown2012), a cross-cultural music classification scheme. In their multidimensional scaling (MDS) plots, songs did not cluster according to their geographical locations or the social contexts in which they were sung, but did form two clusters roughly corresponding to ‘a-modal’ songs (which do not include pitch classes at a minor or major third above the tonic) and ‘major iso-modal’ songs (which have major third notes but lack minor third notes). This result suggests the significance of musical scales in the diversity of folk songs in the Ryukyu Archipelago. Therefore, in this study, we concentrate on the variation in melodies as well as the scales on which they are based.

Koizumi (Reference Koizumi1958) advocated four types of basic scales that constitute traditional Japanese songs (Figure 1), and most of the subsequent studies are based on this theory. The Ryukyu scale is found in the Ryukyu Archipelago (especially from Okinoerabu island in the Amami islands in the north to the Yaeyama islands in the south) and Indonesia. The ritsu scale is broadly found in East Asia including the Ryukyu Archipelago. The minyo scale is very common in mainland Japan but infrequent in the Ryukyu Archipelago, except in the northern part of the Amami islands, where it is relatively popular. The miyako-bushi scale is also found in mainland Japan and rare in the Ryukyu Archipelago, but it is found in the northern part of the Amami islands (Koizumi, Reference Koizumi1958; NHK, 1989–1993).

Figure 1. Four types of scales that constitute traditional Japanese songs advocated by Koizumi (Reference Koizumi1958). (a) The Ryukyu scale. (b) The ritsu scale. (c) The minyo scale. (d) The miyako-bushi scale. Notes indicated in white are considered to be important as ‘nuclear tones’. Based on the figures by Koizumi (Reference Koizumi1958) and NHK (1989–1993).

Melodies are suitable for microevolutionary approaches, as they can be represented as sequences of notes analogous to DNA or protein sequences (Savage, Reference Savage2019). Recently, based on such ideas, studies have been conducted to quantify the similarity between melodies of various kinds of music (Bountouridis et al., Reference Bountouridis, Brown, Wiering and Veltkamp2017; Hillewaere et al., Reference Hillewaere, Manderick, Conklin, Spiliopoulou, Schmidt-Thieme and Janning2014; Janssen et al., Reference Janssen, van Kranenburg and Volk2017; Mongeau & Sankoff, Reference Mongeau and Sankoff1990; Mora et al., Reference Mora, Gómez, Gómez and Díaz-Báñez2016; Savage et al., Reference Savage, Passmore, Chiba, Currie, Suzuki and Atkinson2022; van Kranenburg et al., Reference van Kranenburg, Volk and Wiering2013). Savage and Atkinson (Reference Savage, Atkinson, Müller and Wiering2015) developed a method for quantifying the similarity between melodies by coding and aligning them as sequences of the 12 pitch classes. They proposed combinations of parameter values for their sequence alignment algorithms that perform best when separating songs into different tune families identified by expert musicologists, and when measuring similarities between songs within tune families. Note that a tune family is defined by Bayard (Reference Bayard1950) as ‘a group of melodies showing basic interrelation by means of constant melodic correspondence, and presumably owing their mutual likeness to descent from a single air that has assumed multiple forms through processes of variation, imitation, and assimilation’. In what follows, we quantify the difference between melodies of folk songs in the Ryukyu Archipelago using Savage and Atkinson’s (Reference Savage, Atkinson, Müller and Wiering2015) method. Then we examine whether the qualitative classification of sister songs based on a general judgement is supported by the quantitative evaluation of melodic similarity. Finally, we evaluate the effects of various factors on the variation of melodies within presumably related songs to infer under what circumstances music diversification is accelerated.

2. Materials and methods

2.1. Data

We used two sources of data on folk songs in the Ryukyu Archipelago of Japan. First, melodies were sampled from published musical scores in ‘A Survey of Japanese Folksongs – Okinawa-Amami Islands’ (hereafter SJF; NHK, 1989–1993), for which songs were recorded between 1964 and 1990. Songs in SJF were collected by NHK, with an intention to select songs reflecting traditional life and culture of each region and to include various types of songs (NHK, 1989–1993). Second, for the purpose of examining the possible effect of temporal changes in melodies, we used audio recordings of songs sung by Ryukyuan singers, which were collected by one of the authors (Y. Nishikawa) between 2015 and 2019. In SJF, some songs are similar overall to each other and thus are assumed to have the same origin. As for the songs from our original recordings, the singers described some of the songs they sang as being closely related to others. On the basis of these descriptions, one of the authors (Y. Nishikawa) compiled the songs from SJF and our original recordings into ‘song groups’, or groups of songs that are inferred to have derived from a common origin. Of all audio recordings that we collected, those melodies that did not share the same song group with any melodies from SJF were excluded. For our main analysis, we used 38 song groups (Supplementary Table S1) that included at least three songs, whether from SJF or our original recordings, which amounted to 148 songs from four regions in the Ryukyu Archipelago: Amami, Okinawa, Miyako, and Yaeyama (Figure 2). Eighty-eight of these songs were from SJF, and the remaining 60 were from the audio recordings. In addition, we repeated the analysis using only 44 songs from SJF that belonged to 13 song groups including at least three SJF songs. The songs analysed were old songs whose lyricists and composers are unknown. Songs written in the twentieth century by known lyricists and composers, called shin minyo (new folk songs), were excluded. Ryukyuan classical music, in which variants are not allowed, were also excluded. On the other hand, melodies from SJF (O2.2, O11.1, O12.1, O13.1) that had been published in commercially available records were analysed without distinction from other melodies. Melodies from the audio recordings were performed by professional or amateur singers (it is often difficult to clearly distinguish between them) and were analysed without distinction. Our choice of not excluding songs with possible commercial influence is generally in line with Pendlebury’s (Reference Pendlebury2020) view that the distinction between commercial and folk music is not straightforward.

Figure 2. Map of the Ryukyu Archipelago. The locations of the four regions and 17 islands used for the analyses are indicated. Created based on a map from Geospatial Information Authority of Japan (https://maps.Gsi.Go.Jp/vector/).

2.2. Coding

Following Savage and Atkinson (Reference Savage, Atkinson, Müller and Wiering2015), the melody of each song was coded as a sequence of alphabets assigned to the 12 pitch classes as shown in Figure 3a, where note values and octave differences were ignored. Although alternative coding methods can also be applied, previous studies have produced mixed results about whether the inclusion of note durations improves alignment performance (Janssen et al., Reference Janssen, van Kranenburg and Volk2017; van Kranenburg et al., Reference van Kranenburg, Volk, Wiering and Veltkamp2009). Therefore, this study used a conventional coding method that can deal with melodies as simply as possible, that is, ignoring note durations (Bountouridis et al., Reference Bountouridis, Brown, Wiering and Veltkamp2017; Savage et al., Reference Savage, Passmore, Chiba, Currie, Suzuki and Atkinson2022) and octaves (Bountouridis et al., Reference Bountouridis, Brown, Wiering and Veltkamp2017; Mongeau & Sankoff, Reference Mongeau and Sankoff1990; Savage et al., Reference Savage, Passmore, Chiba, Currie, Suzuki and Atkinson2022). All melodies were transposed so that the tonic was C, and repeated parts and responsorial parts were omitted. The audio recordings were coded with the help of WaveTone (https://ackiesound.ifdef.jp/), a free software program for audio analysis. Folk songs in Japan including the Ryukyu Archipelago are not created based on 12-tone equal temperament, but are based on the four types of five-tone scales (Figure 1) and conventionally represented approximately in staff notation (Koizumi, Reference Koizumi1958). As this study does not analyse pitch difference, there are no major problems in assigning alphabets of the 12 pitch classes.

Figure 3. (a) Letters assigned to the 12 pitch classes. (b) Example of alignment of a pair of melodies using parameter set 2.

2.3. Alignment and similarity between melodies

To infer shared origin between melodies, we aligned the coded sequences following Savage and Atkinson (Reference Savage, Atkinson, Müller and Wiering2015). In particular, we used two sets of parameter values proposed by Savage and Atkinson (Reference Savage, Atkinson, Müller and Wiering2015) for the gap opening penalty (GOP), the gap extension penalty (GEP), and whether difference in mode is included or ignored (i.e. whether lowercase letters are recoded as uppercase letters). The GOP and GEP are to penalize the creation and extension of a gap in a sequence, where the smaller the penalties are, the more likely gaps are to be inserted in the alignment. Small penalties are adequate for closely related sequences, and large penalties are adequate for more divergent sequences (Thompson et al., Reference Thompson, Higgins and Gibson1994). Parameter set 1 (GOP = 12, GEP = 6, ignoring mode) is suggested to be suitable for separating between ‘tune families’, that is, groups of songs that are known to be closely related to each other. Parameter set 2 (GOP = 0.8, GEP = 0.2, including mode) is considered suitable for sequence alignment within a tune family (Figure 3). The similarity between two melodies was calculated as percent identity (PID) according to the following formula:

\begin{equation*}{\text{PID }} = 100\left( {\frac{{ID}}{{\frac{{L1 + L2}}{2}}}} \right),\end{equation*}

where ID represents the number of identical notes after alignment, and L1 and L2 are the lengths of sequences prior to alignment. These analyses were conducted in R version 4.1.1 applying code available at https://github.com/pesavage/melodic-evolution.

2.4. Neighbor-Net

To examine whether the song groups identified by a general judgement can be reproduced by a phylogenetic analysis based on the automated quantification of melodic similarities, we performed a Neighbor-Net analysis with the distance between melodies being calculated as 1 − PID/100. A network was obtained for each of the four regions (Figure 2) and each parameter set, using SplitsTree4 (Huson & Bryant, Reference Huson and Bryant2006).

2.5. Linear mixed model

A linear mixed model (LMM) analysis was performed to examine the factors affecting the melodic changes using lme4 and lmerTest packages in R version 4.1.1. The dependent variable was the distance between 320 pairs of melodies belonging to the same song group, calculated as 1 − PID/100. We first constructed the following full model that included all the independent variables that we considered to be possible factors producing the distance between melodies:

(1)

\begin{equation}{Y_{ij}} = {{{\beta }}_0} + {{{\beta }}_1}{I_{ij}} + {{{\beta }}_2}{D_{ij}} + {{{\beta }}_3}{T_{ij}} + {{{\beta }}_4}{C_{ij}} + {{{\beta }}_5}{S_{ij}} + {r_{0k}}\end{equation}

where Y_ij is the difference between melodies i and j belonging to song group k.

The independent variables are as follows. First, I_ij indicates whether melodies i and j were recorded in different islands (0 if the same island, 1 if different islands). Second, D_ij denotes the geographic distance between the sites where melodies i and j were recorded. For the geographic distance, the latitudes and longitudes of the recording sites were obtained from the documented names of the places using the CSV Address Matching Service (https://geocode.csis.u-tokyo.ac.jp/) provided by the Center for Spatial Information Science at the University of Tokyo, except for two cases (song ID A9.1 and M3.4 in Supplementary Table S1), in which the longitudes and latitudes were obtained using the map from the Geographical Survey Institute of Japan (https://maps.gsi.go.jp/). The geographic distances (km) between locations were calculated from the longitudes and latitudes using geosphere package in R version 4.1.1. As some melodies were described as being sung throughout the Okinawa islands (song ID O2.2, O11.1, O12.1, O13.1; NHK, 1989–1993) and there is no information on where they were recorded, I_ij and D_ij between these and other melodies were set to NA. These variables are included to capture the geographic effects on the amount of mutation in the horizontal transmission of melodies. In other words, as I_ij changes from 0 to 1 or D_ij increases, Y_ij is expected to increase as well.

Third, T_ij represents the difference of the recording years between melodies i and j. This variable is intended to measure the amount of mutation in the vertical transmission of melodies. In other words, as T_ij increases, Y_ij is expected to increase as well. Fourth, C_ij indicates whether melodies i and j were sung in different social contexts or not (0 if the same context, 1 if different contexts). SJF classified songs into ‘child’, ‘ritual’, ‘work’, and ‘amusement’ songs according to the social context in which they were sung in village communities of the Ryukyu Archipelago based on a scheme partially derived from Yanagita (Reference Yanagita1940). The social context ‘child’ includes lullabies and songs sung by children. Melodies of ‘ritual’ analysed in this study are sung for festivals, rain-making, and so on. Melodies of ‘work’ analysed in this study are sung in farming, shipbuilding, and so on. Melodies of ‘amusement’ are sung for the sake of singing. Similarly, we determined the social contexts of our originally recorded songs based on descriptions of the songs by the singers. We examine the hypothesis that sister melodies are more dissimilar when they come to be sung in different social contexts. In other words, as C_ij changes from 0 to 1, Y_ij is expected to increase as well. Fifth, in order to control for any possible difference between melodies from SJF and those from our original recordings, we included S_ij, which indicates whether the sources of melodies i and j are different (0 if the same source, 1 if different sources). These independent variables are listed in Table 1. Finally, to control for possible difference between song groups in the rate of melodic change, a random effect of song groups on the intercept (r _0k) was included. Song groups may vary in, for example, the number of notes with strong rhythmic functions, which has been suggested to be negatively associated with the rate of change (Savage et al., Reference Savage, Passmore, Chiba, Currie, Suzuki and Atkinson2022).

Table 1. List of the independent variables of LMM

Model selection was performed using the step function of lmerTest package. First, starting from the full model (Eq. (1)), backward elimination of the random effects was performed using a likelihood ratio test with a significance level of 0.1. This was followed by backwards elimination of the fixed effects using an F-test based on Satterthwaite’s approximation with a significance level of 0.05. Backwards elimination was chosen to avoid overlooking important variables by accepting too-simplistic models (Kuznetsova et al., Reference Kuznetsova, Brockhoff and Christensen2017, Reference Kuznetsova, Christensen, Bavay and Brockhoff2015).

3. Results

3.1. Similarity between melodies

PID between melodies of the same song group tended to be high. When parameter set 1 was used, the median PID between melodies from the same song groups was 51.75 (interquartile range [IQR] 42.07–62.77) and the median PID between melodies from different song groups was 32.63 (IQR 26.87–38.66) (Figure 4a). When parameter set 2 was used, the median PID was 63.21 (IQR 53.27–72.29) for the same song groups and 43.18 (IQR 36.92–49.56) for different song groups (Figure 4b). When limited to melodies from SJF and using parameter set 1, the median PID was 47.86 (IQR 41.48–56.14) for the same song groups and 34.09 (IQR 28.57–39.53) for different song groups (Figure 4c). When parameter set 2 was used, the median PID was 58.58 (IQR 53.73–66.67) for the same song groups and 44.71 (IQR 38.96–50.51) for different song groups (Figure 4d). In all four cases, the Brunner–Munzel test, which is a nonparametric rank test for two distributions without assuming equal variances (Brunner & Munzel, Reference Brunner and Munzel2000), confirmed that the distributions of the PID values are different between the same-group and different-group comparisons (p < 0.01). These results suggest that quantitative evaluation of song relationships based solely on melodic similarities largely supports the holistic and potentially more subjective classification of song groups.

Figure 4. Distributions of PID between melodies from different song groups and the same song groups. PID values were calculated using (a) all melodies and parameter set 1, (b) all melodies and parameter set 2, (c) only melodies from SJK and parameter set 1, and (d) only melodies from SJK and parameter set 2. The boxes represent the first, second and third quartiles, and lengths of the whiskers represent 1.5 × IQR.

3.2. Neighbor-Net

Neighbor-Net graphs of melodies are shown in Figure 5 and Supplementary Figures S1–4. Delta score (δ) is a measure of deviation from tree structure of phylogenetic data, which equals zero if the data are perfectly consistent with a tree structure and otherwise ranges between 0 and 1. When the delta score is large, the relationship of the taxa is more appropriately represented by a network rather than a tree (Holland et al., Reference Holland, Huber, Dress and Moulton2002). As this study does not assume a single ancestor and phylogenetic relationship between all the melodies analysed, it is reasonable that δ showed moderately large values. Song IDs shown in these figures specify songs as well as the song groups to which they belong; for example, song ID ‘A1.1’ represents the first song in song group A1 (Supplementary Table S1). In the networks of the entire Ryukyu Archipelago, melodies of the same song groups basically formed clusters, except for some melodies of the song groups A2-8, O1, M2, Y2, and Y5. Of these exceptional song groups, the social contexts of the melodies in A2-6 and O2 were varied within a group. This finding further supports the notion that the quantitative evaluation of melodic similarity in songs is congruent with the qualitative classification of song groups (Figure 5). Clusters were formed by the songs based on the Ryukyu and ritsu scales rather than the songs from different regions for both parameter sets 1 and 2. This may be because melodies belonging to the same song group are likely to retain the same mode and the combinations of alphabets used in melodies are the same for the same scale. Although melodies based on the minyo scale (song ID A1.1-3, A2.2-4, A6.2, A6.3, A10.1-3), all of which were from Amami, were included in the cluster of the Ryukyu scale for parameter set 1 (Figure 5a), this is as expected because parameter set 1 ignores difference in mode, and as a consequence, does not distinguish the Ryukyu and minyo scales. Indeed, when parameter set 2 was used, the melodies based on the minyo scale formed a single cluster adjacent to the cluster of the ritsu scale (Figure 5b). In addition, some melodies based on the Ryukyu scales of Okinawa (song ID O1.3, O4.1-3, O8.1-3, O9.1-3, O11.1-3, O12.1-3) were separated from the Ryukyu cluster for both parameter sets. This may be consistent with the claim that song groups O4, O8, O9, and O12 (see Supplementary Table S1) were originally in the ritsu scale and have transformed to be in the Ryukyu scale (NHK, 1989–1993).

Figure 5. Neighbor-Net graphs based on the distances between melodies of the entire Ryukyu Archipelago with (a) parameter set 1 (δ = 0.407) and (b) parameter set 2 (δ = 0.3575). Colours indicate the regions corresponding to Figure 2.

There are some interesting findings from networks of melodies drawn separately for each region. In the networks of Amami, melodies sung in the social context of ritual (song ID A2.1, A2.2, A3.1, A4.1, A4.2, A4.4, A5.1, A5.2, A6.1, A7.1, A7.2, and A7.3) did not show a clear cluster but were roughly cohesive (Supplementary Figure S1). In the networks of Okinawa, songs belonging to a group called ‘Myakuni’ (song group O10) formed a cluster (Supplementary Figure S2a). These are lyrical songs sung in various areas of the Okinawa islands, without fixed lyrics and sometimes improvised (NHK, 1989–1993). Despite the large variety of lyrics, it was confirmed that similar melodies had been maintained across broad areas. It is also said that the melodies of ‘Myakuni’ are similar to those of ‘Tarama-shunkani’ (song group M5) sung in Miyako (NHK, 1989–1993), and indeed they are close to each other in the network of the entire Ryukyu Archipelago with parameter set 1. ‘Tarama-shunkani’, which clustered together in the networks of Miyako (Supplementary Figure S3), are lyrical songs about farewell to an official leaving an island by his local wife. In the networks of Yaeyama, rain-making songs (song group Y2) were divided into two clusters for both parameter sets except for song ID Y2.16, but the clusters split differently depending on the parameter sets (Supplementary Figure S4). Based on previous ethnomusicological studies, rain-making songs in Yaeyama are considered to be further classified into two types (NHK, 1989–1993). One type has diverse melodies and beats, and the tempo is generally slow, whereas the other has less diverse melodies and is in clear duple time, with the tempo being faster. The clusters observed in the Neighbor-Net graphs did not perfectly match this classification, possibly because beat and tempo were not included in the analysis; however, these may suggest that the two types of melodies have not evolved from two clearly distinct ancestral melodies.

3.3. Linear mixed model

Residual and QQ plots for the full model (Eq. (1)) are shown in Supplementary Figure S5. Because the assumptions of linearity and normality are largely satisfied, it is appropriate to use the LMM. As the following results were almost the same between using parameter set 1 and 2, only the results using parameter set 2 are shown in the main article, and the results using parameter set 1 are shown in the Supplement. Model selection was performed using standardized independent and dependent variables, as a result of which the following model was obtained for both parameter sets 1 and 2 (Supplementary Table S2ab):

(2)

\begin{equation}{Y_{ij}} = {{{\beta }}_0} + {{{\beta }}_1}{I_{ij}} + {{{\beta }}_4}{C_{ij}} + {r_{0k}}\end{equation}

In model (2), the variance of the random effect was slightly larger than the variance of the residuals, indicating a stronger random effect of song groups (Supplementary Table S4). The estimated partial regression coefficients for model (2) are shown in Table 2 and Supplementary Table S5. For both parameter sets, the effects of difference in islands and social contexts were statistically significant, with the standard partial regression coefficient for difference in social contexts being larger than that for difference in islands.

Table 2. Results of LMM analysis for all melodies. Model (2) with standardized variables and parameter set 2

Signif. codes: 0

‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘’ 1.

Whereas standardized variables were used to compare the effect sizes across independent variables, non-standardized variables were used to quantify the amount of dissimilarity of melodies corresponding to changes in each independent variable (e.g. how much the melody changes when two recording sites are 1 km apart from each other). Model selection based on non-standardized independent and dependent variables resulted in the same model as model (2) for both parameter sets (Supplementary Table S2ef). The estimates of partial regression coefficients are shown in Table 3 and Supplementary Table S6. The estimates of the effect of difference in islands were 0.044 for parameter set 1 and 0.034 for parameter set 2, representing the extra amount of dissimilarity between a pair of melodies from the same song group when they had been recorded in different islands as compared with when they were from the same island. The estimates of the effects of difference in social contexts were 0.164 for parameter set 1 and 0.145 for parameter set 2, showing the extra distance between a pair of melodies from the same social group when they are sung in different social contexts as compared with when sung in the same social context. Additional analyses using several models not selected in the model selection were also conducted (Supplementary Table S2cdgh). In any cases, the effect of geographic distance and difference in recording years were not significant (Supplementary Tables S8–10).

Table 3. Results of LMM analysis for all melodies. Model (2) with non-standardized variables and parameter set 2

Signif. codes: 0

‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘’ 1.

Although the model selection did not indicate a significant effect of the difference in sources of melodies (S), we conducted additional analyses using only melodies from SJF (126 pairs of melodies from the same song group), considering the potential difference in data characteristics, such as coding criteria, between SJF and our original recordings. Model selection based on standardized variables obtained the following model for both parameter sets (Supplementary Table S3ab):

(3)

\begin{equation}{Y_{ij}} = {{{\beta }}_0} + {{{\beta }}_1}{I_{ij}} + {{{\beta }}_2}{D_{ij}} + {r_{0k}}\end{equation}

Other independent variables than I and D were eliminated by the model selection. The estimates of partial regression coefficients are shown in Table 4 and Supplementary Table S7. For both parameter sets, the effect of difference in islands was significantly positive, and the effect of geographic distance was significantly negative. The negative effect of geographic distance is contrary to our hypothesis, which posits that because cultural transmission of melodies within a narrow geographic area may occur repeatedly, melodies derived from the same origin tend to remain similar. The result is difficult to interpret and might be an artefact attributable to the small sample size and a positive correlation between I and D (r = 0.584, p < 0.01), even though multicollinearity was not suggested (Supplementary Table S13). When model selection was made under the restriction that at most one of I and D is entered into the model, a model without any fixed effect was selected for both parameter sets (Supplementary Table S3c–f). In contrast to model (2), the effect of difference in social context was not included in model (3). According to a post-hoc power analysis for the full model using the powerSim function in the simr package, the statistical power for difference in social contexts was high (99.90%) for both parameter sets when all melodies were used, but lower (19.70% for parameter set 1 and 13.40% for parameter set 2) when limited to melodies from SJF, probably due to the small sample size.

Table 4. Results of LMM analysis for melodies from SJF. Model (3) with standardized variables and parameter set 2

Signif. codes: 0

‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘’ 1.

4. Discussion

In this study, we attempted to illuminate factors affecting cultural evolution of folk songs in the Ryukyu Archipelago by measuring melodic differences between songs, and evaluating the effects of variables that may have promoted or suppressed song diversification. To this end, we first classified songs into song groups, based on various sources of information, as aggregates of songs that have presumably originated from a common origin. We then quantified melodic differences between songs using Savage and Atkinson’s (Reference Savage, Atkinson, Müller and Wiering2015) method, and examined whether a conventional phylogenetic approach supports our assumption of the shared origin of songs within a song group. Finally, we investigated effects of several variables on melodic differences between songs in the same song group to infer the factors affecting song diversification. The results of the analysis suggested the following: first, our classification of song groups is largely consistent with Neighbor-Net analysis (Huson & Bryant, Reference Huson and Bryant2006); second, a pair of songs tend to be more dissimilar when they are sung in different islands within the Ryukyu Archipelago than when sung in the same island, and when they are sung in different social contexts than when sung in the same social context; and third, the effects of geographic distance, recording year, or data source on song diversification are small or negligible.

In contrast to Nishikawa and Ihara (Reference Nishikawa and Ihara2022), who quantified distances between folk songs in the Ryukyu Archipelago on the basis of 26 CantoCore variables (Savage et al., Reference Savage, Merritt, Rzeszutek and Brown2012), this study focused strictly on the melodic variation (Savage & Atkinson, Reference Savage, Atkinson, Müller and Wiering2015), and observed clusters of songs in the Ryukyu scale, those in the ritsu scale, and those in the minyo scale in Neighbor-Net graphs. In addition, some songs in the Ryukyu scale of Okinawa formed a separate cluster, which may reflect the history that these songs were originally in the ritsu scale and were later modified to be in the Ryukyu scale. A limitation common to the present study and that of Nishikawa and Ihara (Reference Nishikawa and Ihara2022) is that their analyses did not take lyrics, an obviously important aspect of folk songs, into consideration. In fact, some songs of ‘Myakuni’, for example, were judged as close to each other despite their stark difference in lyrics. In other cases, songs with almost the same lyrics were judged as distant from each other for their melodic dissimilarity. These observations suggest that melodies and lyrics may be transmitted through different pathways. Folk songs in the Ryukyu Archipelago are sung in various local dialects of the Ryukyuan languages, which share a common ancestor with Japanese and diverged before the seventh century (Pellard, Reference Pellard, Heinrich, Miyara and Shimoji2015). The Ryukyuan languages are classified into five groups – Amami, Okinawa, Miyako, Yaeyama, and Yonaguni – each corresponding to a region, and we focused on the four regions other than Yonaguni in this study. Lyrics of the folk songs are basically in the local dialects, although there are some exceptional cases in which songs are sung in different dialects (NHK, 1989–1993). It would be meaningful to examine whether the lyrics and melody of a song is always transmitted together, and whether the lyrics are more vulnerable to change than the melody is, particularly when the song diffuses into different regions.

The linear mixed model analysis suggested that a pair of melodies within the same song group tend to be more dissimilar when they are recorded in different islands than when recorded in the same islands, and when they are sung in different social contexts than when sung in the same social context. The reason why two sister songs separated by the sea tend to be more dissimilar may be that travel is more difficult and interactions less frequent between than within islands. Our analysis further suggested that the key variable promoting diversification of two songs is whether one of them has crossed the ocean, but not the geographic distance between them, suggesting that the sea is a cultural barrier that is stronger than expected from mere distance. As for the effect of social context, our results suggest that sister songs tend to be more dissimilar when one of them comes to be sung in a social context that is different from the one in which they were originally sung. This is consistent with the finding in a previous study that musical characteristics differ depending on their context (Mehr et al., Reference Mehr, Singh, Knox, Ketter, Pickens-Jones, Atwood, Lucas, Jacoby, Egner, Hopkins, Howard, Hartshorne, Jennings, Simson, Bainbridge, Pinker, O’Donnell, Krasnow and Glowacki2019), suggesting the importance of music’s social function. Nishikawa and Ihara (Reference Nishikawa and Ihara2022) also found that songs sung in work-related contexts (i.e. work songs) tend to vary more between regions than child, ritual, or amusement songs. In contrast, the effect of difference in recording years was not significant in any cases. Although the difference in recording years between melodies was 52 years at maximum, this may be too short for any changes in the melodies of folk songs to be detected, considering the fact that it is getting easier to listen to and imitate older performances of folk songs because of the recent developments in recording technology, as Pendlebury (Reference Pendlebury2020) pointed out. A caveat to our interpretation of the LMM results is that the model does not include the divergence time between pairs of melodies as an independent variable; thus, it could be that pairs of sister songs sung in different islands or different social contexts tend to be dissimilar because they tend to have diverged earlier than other pairs of sister songs, a possibility that seems to us unlikely, but cannot be tested with the current data. Note that whereas melodies in some song groups might change more rapidly or slowly than those in other song groups depending on, for example, the number of notes with strong rhythmic functions, the resulting variance in melodic similarity between song groups is taken into consideration by the incorporation of the random effect of song groups.

In conclusion, our quantitative analysis on folk songs in the Ryukyu Archipelago suggests that diversification of sister songs derived from a common origin is promoted when one of them is transmitted from one island to another, and when one of them undergoes a change of the social context in which it is sung. Our conclusion is based only on the difference in melody between songs, and future studies should also consider other aspects of song variation, most notably the difference in lyrics between songs, to further explore the factors affecting cultural evolution of folk songs.

Supplementary material

The supplementary material for this article can be found at https://doi.org/10.1017/ehs.2025.10010.

Acknowledgements

The authors would like to thank all the singers, researchers, government officials, and other local people who helped us with song recordings at Amami-Oshima, Okinawa, Miyako, and Ishigaki islands. The authors would also like to thank Patrick Savage for providing R source code to calculate PID between melodies. DeepL (https://www.deepl.com/) was partially used to check English wording in the manuscript.

Author contributions

YN and YI conceived and designed the study. YN conducted data gathering. YN performed statistical analyses. YN and YI wrote the article.

Financial support

This work was supported by Yamaha Music Foundation (YI) and KAKENHI (YI, grant number JP17H06381).

Conflicts of interest

YN and YI declare none.

Research transparency and reproducibility

Song data are available from Supplementary material. R source code to calculate PID between melodies is available at https://github.com/pesavage/melodic-evolution.

References

Bayard, S. P. (1950). Prolegomena to a study of the principal melodic families of British–American folk song. The Journal of American Folklore, 63(247), 1. https://doi.org/10.2307/537347CrossRef Google Scholar

Bountouridis, D., Brown, D., Wiering, F., & Veltkamp, R. (2017). Melodic similarity and applications using biologically-inspired techniques. Applied Sciences, 7(12), 1242. https://doi.org/10.3390/app7121242CrossRef Google Scholar

Brunner, E., & Munzel, U. (2000). The nonparametric Behrens–Fisher problem: asymptotic theory and a small-sample approximation. Biometrical Journal, 42(1), 17–25. https://doi.org/10.1002/(SICI)1521-4036(200001)42:1<17::AID-BIMJ17>3.0.CO;2-U3.0.CO;2-U>CrossRef Google Scholar

Chiang, T.-Y., & Schaal, B. A. (2006). Phylogeography of plants in Taiwan and the Ryukyu Archipelago. Taxon, 55(1), 31–41. https://doi.org/10.2307/25065526CrossRef Google Scholar

Hillewaere, R., Manderick, B., & Conklin, D. (2014). Alignment methods for folk tune classification. In Spiliopoulou, M., Schmidt-Thieme, L., & Janning, R. Eds., Data analysis, machine learning and knowledge discovery (369–377). Springer. https://doi.org/10.1007/978-3-319-01595-8_40CrossRef Google Scholar

Holland, B. R., Huber, K. T., Dress, A., & Moulton, V. (2002). δ Plots: a tool for analyzing phylogenetic distance data. Molecular Biology and Evolution, 19(12), 2051–2059. https://doi.org/10.1093/oxfordjournals.molbev.a004030CrossRef Google Scholar PubMed

Huson, D. H., & Bryant, D. (2006). Application of phylogenetic networks in evolutionary studies. Molecular Biology and Evolution, 23(2), 254–267. https://doi.org/10.1093/molbev/msj030CrossRef Google Scholar PubMed

Janssen, B., van Kranenburg, P., & Volk, A. (2017). Finding occurrences of melodic segments in folk songs employing symbolic similarity measures. Journal of New Music Research, 46(2), 118–134. https://doi.org/10.1080/09298215.2017.1316292CrossRef Google Scholar

Koizumi, F. (1958). Nihon dento ongaku no kenkyu 1. Ongaku no tomo sha. In Japanese.Google Scholar

Kuznetsova, A., Brockhoff, P. B., & Christensen, R. H. B. (2017). lmerTest Package: tests in linear mixed effects models. Journal of Statistical Software, 82(13), 1–26. https://doi.org/10.18637/jss.v082.i13CrossRef Google Scholar

Kuznetsova, A., Christensen, R. H. B., Bavay, C., & Brockhoff, P. B. (2015). Automated mixed ANOVA modeling of sensory and consumer data. Food Quality and Preference, 40(PA), 31–38. https://doi.org/10.1016/j.foodqual.2014.08.004CrossRef Google Scholar

Mehr, S. A., Singh, M., Knox, D., Ketter, D. M., Pickens-Jones, D., Atwood, S., Lucas, C., Jacoby, N., Egner, A. A., Hopkins, E. J., Howard, R. M., Hartshorne, J. K., Jennings, M. V., Simson, J., Bainbridge, C. M., Pinker, S., O’Donnell, T. J., Krasnow, M. M., & Glowacki, L. (2019). Universality and diversity in human song. Science, 366(6468), eaax0868. https://doi.org/10.1126/science.aax0868CrossRef Google Scholar PubMed

Mongeau, M., & Sankoff, D. (1990). Comparison of musical sequences. Computers and the Humanities, 24(3), 161–175. https://doi.org/10.1007/BF00117340CrossRef Google Scholar

Mora, J., Gómez, F., Gómez, E., & Díaz-Báñez, J. M. (2016). Melodic contour and mid-level global features applied to the analysis of Flamenco Cantes. Journal of New Music Research, 45(2), 145–159. https://doi.org/10.1080/09298215.2016.1174717CrossRef Google Scholar

Motokawa, M. (2000). Biogeography of living mammals in the Ryukyu Islands. Tropics, 10(1), 63–71. https://doi.org/10.3759/tropics.10.63CrossRef Google Scholar

Nippon Hoso Kyokai [NHK]. (1989–1993). A survey of Japanese folksongs – Okinawa-Amami Islands. NHK Publishing. In Japanese.Google Scholar

Nishikawa, Y., & Ihara, Y. (2022). Cultural transmission of traditional songs in the Ryukyu Archipelago. Public Library of Science ONE, 17(6), e0270354. https://doi.org/10.1371/journal.pone.0270354Google Scholar PubMed

Ota, H. (1998). Geographic patterns of endemism and speciation in amphibians and reptiles of the Ryukyu Archipelago, Japan, with special reference to their paleogeographical implications. Researches on Population Ecology, 40(2), 189–204. https://doi.org/10.1007/BF02763404CrossRef Google Scholar

Passmore, S., Wood, A. L. C., Barbieri, C., Shilton, D., Daikoku, H., Atkinson, Q. D., & Savage, P. E. (2024). Global musical diversity is largely independent of linguistic and genetic histories. Nature Communications, 15(1), 3964. https://doi.org/10.1038/s41467-024-48113-7CrossRef Google Scholar PubMed

Pellard, T. (2015). The linguistic archeology of the Ryukyu Islands. In Heinrich, P., Miyara, S., & Shimoji, M. (Eds.), Handbook of the Ryukyuan languages: history, structure, and use (13–37). De Gruyter Mouton.CrossRef Google Scholar

Pendlebury, C. (2020). Tune families and tune histories: melodic resemblances in British and Irish folk tunes. Folk Music Journal, 11(5), 67–95. https://www.jstor.org/stable/45280996 Google Scholar

Savage, P. E. (2019). Cultural evolution of music. Palgrave Communications, 5(1), 16. https://doi.org/10.1057/s41599-019-0221-1CrossRef Google Scholar

Savage, P. E., & Atkinson, Q. D. (2015). Automatic tune family identification by musical sequence alignment. In Müller, M., & Wiering, F. (Eds.), Proceedings of the 16th International Society for Music Information Retrieval Conference (162–168).Google Scholar

Savage, P. E., Merritt, E., Rzeszutek, T., & Brown, S. (2012). CantoCore: a new cross-cultural song classification scheme. Analytical Approach to World Music, 2, 87–137. https://journal.iftawm.org/previous/vol2no1/savage-merritt-rzeszutek-brown/.Google Scholar

Savage, P. E., Passmore, S., Chiba, G., Currie, T. E., Suzuki, H., & Atkinson, Q. D. (2022). Sequence alignment of folk song melodies reveals cross-cultural regularities of musical evolution. Current Biology, 32(6), . https://doi.org/10.1016/j.cub.2022.01.039CrossRef Google Scholar PubMed

Thompson, J. D., Higgins, D. G., & Gibson, T. J. (1994). CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Research, 22(22), 4673–4680. https://doi.org/10.1093/nar/22.22.4673CrossRef Google Scholar PubMed

Uchida, R. (1983). Amami minyo to sono shuhen. Yuzankaku. In Japanese.Google Scholar

van Kranenburg, P., Volk, A., & Wiering, F. (2013). A comparison between global and local features for computational classification of folk song melodies. Journal of New Music Research, 42(1), 1–18. https://doi.org/10.1080/09298215.2012.718790CrossRef Google Scholar

van Kranenburg, P., Volk, A., Wiering, F., & Veltkamp, R. C. (2009). Musical models for folk-song melody alignment. Proceedings of the 10th International Society for Music Information Retrieval Conference ().Google Scholar

Wood, A. L. C., Kirby, K. R., Ember, C. R., Silbert, S., Passmore, S., Daikoku, H., McBride, J., Paulay, F., Flory, M. J., Szinger, J., D’Arcangelo, G., Bradley, K. K., Guarino, M., Atayeva, M., Rifkin, J., Baron, V., El Hajli, M., Szinger, M., & Savage, P. E. (2022). The Global Jukebox: a public database of performing arts and culture. Public Library of Science ONE, 17(11), e0275469. https://doi.org/10.1371/journal.pone.0275469Google Scholar PubMed

Yamanouchi, S. (1959). The history of musical culture in Ryūkyū. Minzoku geino zenshu kako kai. In Japanese.Google Scholar

Yanagita, K. (1940). Minyo oboegaki. Sogen sha. Japanese.Google Scholar

Figure 1. Four types of scales that constitute traditional Japanese songs advocated by Koizumi (1958). (a) The Ryukyu scale. (b) The ritsu scale. (c) The minyo scale. (d) The miyako-bushi scale. Notes indicated in white are considered to be important as ‘nuclear tones’. Based on the figures by Koizumi (1958) and NHK (1989–1993).

Figure 3. (a) Letters assigned to the 12 pitch classes. (b) Example of alignment of a pair of melodies using parameter set 2.

Table 1. List of the independent variables of LMM

Table 2. Results of LMM analysis for all melodies. Model (2) with standardized variables and parameter set 2

Table 3. Results of LMM analysis for all melodies. Model (2) with non-standardized variables and parameter set 2

Table 4. Results of LMM analysis for melodies from SJF. Model (3) with standardized variables and parameter set 2

Nishikawa and Ihara supplementary material 1

Nishikawa and Ihara supplementary material

File 32.9 KB

Nishikawa and Ihara supplementary material 2

Nishikawa and Ihara supplementary material

File 1.4 MB

Article contents

Exploring factors for melodic diversification of folk songs in the Ryukyu Archipelago

Abstract

Keywords

Information

Social media summary

1. Introduction

2. Materials and methods

2.1. Data

2.2. Coding

2.3. Alignment and similarity between melodies

2.4. Neighbor-Net

2.5. Linear mixed model

3. Results

3.1. Similarity between melodies

3.2. Neighbor-Net

3.3. Linear mixed model

4. Discussion

Supplementary material

Acknowledgements

Author contributions

Financial support

Conflicts of interest

Research transparency and reproducibility

References

Nishikawa and Ihara supplementary material 1

Nishikawa and Ihara supplementary material 2

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests