Abstract:Precise sound collection and monitoring is the foundation of Soundscape Ecology. However, there is an urgent need to address the lack of systematic summary of spatio-temporal scale selection in Soundscape Ecology. The monitoring spatio-temporal scale of Soundscape Ecology is closely related to its subsequent research conclusions. Choosing different monitoring spatio-temporal scales has a significant impact on the research conclusions. Therefore, choosing a reasonable monitoring spatio-temporal scale is crucial for precisely reflecting the soundscape dynamic and biodiversity. We reviewed and summarized the spatial and temporal scales of Soundscape Ecology and their selection based on 86 research articles from the Core Collection of the Web of Science database between 2011 and 2023. In our review, we extracted and classified the information of spatio-temporal scales, including research objectives, sampling temporal schedule (continuous or interval recording), temporal length, spatial placement, spatial minimum distance (in meters) between devices, and spatial average surface height (in meters) of equipment. With respect to spatial scale, the placements were categorized into partition, grid, transect, random and special placement method. The choice of placement method depended on the application range and spatial scale, with the partition and grid placement method were the most commonly used in Soundscape Ecology. The most commonly observed minimum distance between devices was 200 meters, but in order to achieve the best monitoring performance of the equipment, it is necessary to obtain the monitoring range of the equipment based on different models of recording equipment and different environmental factors. The average surface height of the equipment was most commonly 1.5 meters, with lower surface heights suitable for exploring the soundscape of insect communities near ground shrubs and higher surface heights suitable for recording avian acoustic activities in tree canopies. In terms of temporal scale, sampling schedules were divided into continuous recording and interval recording. Continuous recording was suitable for collecting short-term and refined temporal scale data, such as capturing dawn or dusk chorus changes within 4 hours or diel soundscape changes within 72 hours. Triggered continuous recording was suitable for collecting long-term acoustic information of specific vocal species. Interval recording was suitable for capturing long-term acoustic pattern changes, with intervals of 1 in 10 minutes and 1 in 15 minutes being the most commonly used, accounting for 12.3% and 7.4%, respectively. Regarding scale selection, research objectives could be grouped into three categories: studying soundscape pattern and their driving factors, evaluating ecological environment quality, and evaluating biodiversity and its acoustic characteristics. Different soundscape ecological processes exhibited significant differences in spatio-temporal scales selection, but there were also similar selection trends within the same process category. For example, studying soundscape patterns and their driving factors often involves interval recording and grid placement methods, while evaluating ecological environment quality often involves interval recording and transect placement methods. Evaluating biodiversity and its acoustic characteristics often requires continuous recording and random placement methods. The partition placement method was widely used across various research objectives. At present, Soundscape Ecology requires answering three questions of dependence between spatial scale and ecological hierarchy structure, temporal scale conversion and its conversion accuracy, and reasonable selection of spatio-temporal scale. We encourage future research to focus on building spatial-dimensional monitoring network, conducting ecology multi-scale simulation study, and strengthening quantitative evaluation of scale selection.