Hector Galaxy Survey: Data processing, quality control, and early science

Sree Oh; Madusha Gunawardhana; Scott Croom; Gabriella Quattropani; Sujeeporn Tuntipong; Julia Bryant; Pablo Corcho Caballero; Pratyush Kumar Das; Oğuzhan Çakır; Joon Hyeop Lee; A. Ristea; Stefania Barsanti; Mina Pak; Sarah Sweet; Tom Woodrow; Thomas Rutherford; Yifan Mai; Matt Owers; Matthew Colless; Lachlan Stuart; Henry R. M. Zovaro; Sam Vaughan; Jesse van de Sande; Tony Farrell; Minje Beom; Joss J. Bland-Hawthorn; Jiwon Chung; Caroline Foster; Kathryn Grasha; Hyunjin Jeong; Jong Chul Lee; Anilkumar Mailvaganam; Kyuseok Oh; Simon O’Toole; Edward N. Taylor; Tayyaba Zafar; Gurashish Bhatia; David Brodrick; Rebecca Brown; Elton Cheng; Robert Content; Fred Crous; Peter Gillingham; Ellen Houston; Jon Lawrence; Helen McGregor; Mahesh Mohanan; Seong-sik Min; Barnaby Norris; Naveen Pai; Ayoan Sadman; Will Saunders; Adeline Wang; Ross Zhelem; Jessica Zheng

doi:10.1017/pasa.2025.10106

Hector Galaxy Survey: Data processing, quality control, and early science

Published online by Cambridge University Press: 09 October 2025

Sree Oh*: Affiliation:
Department of Astronomy and Yonsei University Observatory, Yonsei University, Seoul, Republic of Korea
Madusha Gunawardhana*: Affiliation:
Sydney Institute for Astronomy (SIfA), School of Physics, The University of Sydney, Sydney, NSW, Australia Research School of Astronomy and Astrophysics, Australian National University, Canberra, ACT, Australia ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia
Scott Croom: Affiliation:
Sydney Institute for Astronomy (SIfA), School of Physics, The University of Sydney, Sydney, NSW, Australia ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia
Gabriella Quattropani: Affiliation:
ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia School of Mathematical and Physical Sciences, Macquarie University, Sydney, NSW, Australia Astrophysics and Space Technologies Research Centre, Macquarie University, Sydney, NSW, Australia
Sujeeporn Tuntipong: Affiliation:
Sydney Institute for Astronomy (SIfA), School of Physics, The University of Sydney, Sydney, NSW, Australia ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia
Julia Bryant: Affiliation:
Sydney Institute for Astronomy (SIfA), School of Physics, The University of Sydney, Sydney, NSW, Australia Astralis-USyd, Sydney Institute for Astronomy, School of Physics, The University of Sydney, Sydney, NSW, Australia
Pablo Corcho Caballero: Affiliation:
Kapteyn Astronomical Institute, University of Groningen, AV Groningen, The Netherlands
Pratyush Kumar Das: Affiliation:
School of Mathematics and Physics, University of Queensland, Brisbane, QLD, Australia
Oğuzhan Çakır: Affiliation:
ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia School of Mathematical and Physical Sciences, Macquarie University, Sydney, NSW, Australia Astrophysics and Space Technologies Research Centre, Macquarie University, Sydney, NSW, Australia
Joon Hyeop Lee: Affiliation:
Korea Astronomy and Space Science Institute (KASI), Yuseong-gu, Daejeon, Republic of Korea
A. Ristea: Affiliation:
ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia International Centre for Radio Astronomy Research, The University of Western Australia, Crawley, WA, Australia
Stefania Barsanti: Affiliation:
Sydney Institute for Astronomy (SIfA), School of Physics, The University of Sydney, Sydney, NSW, Australia Research School of Astronomy and Astrophysics, Australian National University, Canberra, ACT, Australia
Mina Pak: Affiliation:
ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia School of Mathematical and Physical Sciences, Macquarie University, Sydney, NSW, Australia Korea Astronomy and Space Science Institute (KASI), Yuseong-gu, Daejeon, Republic of Korea
Sarah Sweet: Affiliation:
ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia School of Mathematics and Physics, University of Queensland, Brisbane, QLD, Australia
Tom Woodrow: Affiliation:
Siding Spring Observatory, Research School of Astronomy and Astrophysics, Australian National University, Canberra, ACT, Australia
Thomas Rutherford: Affiliation:
Sydney Institute for Astronomy (SIfA), School of Physics, The University of Sydney, Sydney, NSW, Australia European Southern Observatory, Garching, Germany
Yifan Mai: Affiliation:
Sydney Institute for Astronomy (SIfA), School of Physics, The University of Sydney, Sydney, NSW, Australia ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia Australian Astronomical Optics, Macquarie University, Sydney, NSW, Australia
Matt Owers: Affiliation:
ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia School of Mathematical and Physical Sciences, Macquarie University, Sydney, NSW, Australia Astrophysics and Space Technologies Research Centre, Macquarie University, Sydney, NSW, Australia
Matthew Colless: Affiliation:
Research School of Astronomy and Astrophysics, Australian National University, Canberra, ACT, Australia
Lachlan Stuart: Affiliation:
Sydney Institute for Astronomy (SIfA), School of Physics, The University of Sydney, Sydney, NSW, Australia
Henry R. M. Zovaro: Affiliation:
Research School of Astronomy and Astrophysics, Australian National University, Canberra, ACT, Australia
Sam Vaughan: Affiliation:
School of Mathematical and Physical Sciences, Macquarie University, Sydney, NSW, Australia Astronomy, Astrophysics and Astrophotonics Research Centre, Macquarie University, Sydney, NSW, Australia Centre for Astrophysics and Supercomputing, School of Science, Swinburne University of Technology, Hawthorn, VIC, Australia
Jesse van de Sande: Affiliation:
Sydney Institute for Astronomy (SIfA), School of Physics, The University of Sydney, Sydney, NSW, Australia ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia School of Physics, University of New South Wales, Sydney, NSW, Australia
Tony Farrell: Affiliation:
Astralis-AAO, Australian Astronomical Optics, Faculty of Science and Engineering, Macquarie University, Sydney, NSW, Australia
Minje Beom: Affiliation:
Korea Astronomy and Space Science Institute (KASI), Yuseong-gu, Daejeon, Republic of Korea
Joss J. Bland-Hawthorn: Affiliation:
Sydney Institute for Astronomy (SIfA), School of Physics, The University of Sydney, Sydney, NSW, Australia ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia
Jiwon Chung: Affiliation:
Korea Astronomy and Space Science Institute (KASI), Yuseong-gu, Daejeon, Republic of Korea Institute for Data Innovation in Science, Seoul National University, Seoul, Republic of Korea
Caroline Foster: Affiliation:
School of Physics, University of New South Wales, Sydney, NSW, Australia
Kathryn Grasha: Affiliation:
Research School of Astronomy and Astrophysics, Australian National University, Canberra, ACT, Australia ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia
Hyunjin Jeong: Affiliation:
Korea Astronomy and Space Science Institute (KASI), Yuseong-gu, Daejeon, Republic of Korea
Jong Chul Lee: Affiliation:
Korea Astronomy and Space Science Institute (KASI), Yuseong-gu, Daejeon, Republic of Korea
Anilkumar Mailvaganam: Affiliation:
ARC Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO 3D), Australia School of Mathematical and Physical Sciences, Macquarie University, Sydney, NSW, Australia Astrophysics and Space Technologies Research Centre, Macquarie University, Sydney, NSW, Australia
Kyuseok Oh: Affiliation:
Korea Astronomy and Space Science Institute (KASI), Yuseong-gu, Daejeon, Republic of Korea
Simon O’Toole: Affiliation:
Astrophysics and Space Technologies Research Centre, Macquarie University, Sydney, NSW, Australia Australian Astronomical Optics, Macquarie University, Sydney, NSW, Australia
Edward N. Taylor: Affiliation:
Centre for Astrophysics and Supercomputing, School of Science, Swinburne University of Technology, Hawthorn, VIC, Australia
Tayyaba Zafar: Affiliation:
School of Mathematical and Physical Sciences, Macquarie University, Sydney, NSW, Australia Astrophysics and Space Technologies Research Centre, Macquarie University, Sydney, NSW, Australia
Gurashish Bhatia: Affiliation:
Astralis-USyd, Sydney Institute for Astronomy, School of Physics, The University of Sydney, Sydney, NSW, Australia
David Brodrick: Affiliation:
Research School of Astronomy and Astrophysics, Australian National University, Canberra, ACT, Australia
Rebecca Brown: Affiliation:
Astralis-USyd, Sydney Institute for Astronomy, School of Physics, The University of Sydney, Sydney, NSW, Australia
Elton Cheng: Affiliation:
Astralis-USyd, Sydney Institute for Astronomy, School of Physics, The University of Sydney, Sydney, NSW, Australia
Robert Content: Affiliation:
Astralis-AAO, Australian Astronomical Optics, Faculty of Science and Engineering, Macquarie University, Sydney, NSW, Australia
Fred Crous: Affiliation:
Astralis-USyd, Sydney Institute for Astronomy, School of Physics, The University of Sydney, Sydney, NSW, Australia
Peter Gillingham: Affiliation:
Astralis-AAO, Australian Astronomical Optics, Faculty of Science and Engineering, Macquarie University, Sydney, NSW, Australia
Ellen Houston: Affiliation:
Astralis-AAO, Australian Astronomical Optics, Faculty of Science and Engineering, Macquarie University, Sydney, NSW, Australia
Jon Lawrence: Affiliation:
Astralis-AAO, Australian Astronomical Optics, Faculty of Science and Engineering, Macquarie University, Sydney, NSW, Australia
Helen McGregor: Affiliation:
Astralis-AAO, Australian Astronomical Optics, Faculty of Science and Engineering, Macquarie University, Sydney, NSW, Australia
Mahesh Mohanan: Affiliation:
Astralis-AAO, Australian Astronomical Optics, Faculty of Science and Engineering, Macquarie University, Sydney, NSW, Australia
Seong-sik Min: Affiliation:
Astralis-USyd, Sydney Institute for Astronomy, School of Physics, The University of Sydney, Sydney, NSW, Australia
Barnaby Norris: Affiliation:
Sydney Institute for Astronomy (SIfA), School of Physics, The University of Sydney, Sydney, NSW, Australia Astralis-USyd, Sydney Institute for Astronomy, School of Physics, The University of Sydney, Sydney, NSW, Australia
Naveen Pai: Affiliation:
Astralis-AAO, Australian Astronomical Optics, Faculty of Science and Engineering, Macquarie University, Sydney, NSW, Australia
Ayoan Sadman: Affiliation:
Astralis-USyd, Sydney Institute for Astronomy, School of Physics, The University of Sydney, Sydney, NSW, Australia
Will Saunders: Affiliation:
Astralis-AAO, Australian Astronomical Optics, Faculty of Science and Engineering, Macquarie University, Sydney, NSW, Australia
Adeline Wang: Affiliation:
Astralis-USyd, Sydney Institute for Astronomy, School of Physics, The University of Sydney, Sydney, NSW, Australia
Ross Zhelem: Affiliation:
Astralis-AAO, Australian Astronomical Optics, Faculty of Science and Engineering, Macquarie University, Sydney, NSW, Australia
Jessica Zheng: Affiliation:
Astralis-AAO, Australian Astronomical Optics, Faculty of Science and Engineering, Macquarie University, Sydney, NSW, Australia
*: Corresponding authors:Sree Oh; Email: sreemario@gmail.com; Madusha Gunawardhana; Email: madusha.gunawardhana@sydney.edu.au
Corresponding authors:Sree Oh; Email: sreemario@gmail.com; Madusha Gunawardhana; Email: madusha.gunawardhana@sydney.edu.au

Article contents

Abstract
Introduction
Data processing and quality control
Verification of early science data
Summary and conclusions
Data availability statement
Author contributions
Funding statement
Footnotes
References

Rights & Permissions

Abstract

The Hector Galaxy Survey is a new optical integral field spectroscopy (IFS) survey currently using the Anglo-Australian Telescope to observe up to 15 000 galaxies at low redshift ($z \lt 0.1$). The Hector instrument employs 21 optical fibre bundles feeding into two double-beam spectrographs, AAOmega and the new Spector spectrograph, to enable wide-field multi-object IFS observations of galaxies. To efficiently process the survey data, we adopt the data reduction pipeline developed for the SAMI Galaxy Survey, with significant updates to accommodate Hector’s dual-spectrograph system. These enhancements address key differences in spectral resolution and other instrumental characteristics relative to SAMI and are specifically optimised for Hector’s unique configuration. We introduce a two-dimensional arc fitting approach that reduces the root-mean-square (RMS) velocity scatter by a factor of 1.2–3.4 compared to fitting arc lines independently for each fibre. The pipeline also incorporates detailed modelling of chromatic optical distortion in the wide-field corrector, to account for wavelength-dependent spatial shifts across the focal plane. We assess data quality through a series of validation tests, including wavelength solution accuracy (1.2–2.7 km s$^{-1}$ RMS), spectral resolution (FWHM of 1.2–1.4 Å for Spector), throughput characterisation, astrometric precision ($\lesssim$ 0.03 arcsec median offset), sky subtraction residuals (1–1.6% median continuum residual), and flux calibration stability (4% systematic offset when compared to Legacy Survey fluxes). We demonstrate that Hector delivers high-fidelity, science-ready datasets, supporting robust measurements of galaxy kinematics, stellar populations, and emission-line properties and provide examples. Additionally, we address systematic uncertainties identified during the data processing and propose future improvements to enhance the precision and reliability of upcoming data releases. This work establishes a robust data reduction framework for Hector, delivering high-quality data products that support a broad range of extragalactic studies.

Keywords

Galaxies: general astronomical data bases: surveys instrumentation: spectrographs techniques: imaging spectroscopy methods: data analysis

Information

Type: Research Article
Information: Publications of the Astronomical Society of Australia , Volume 42 , 2025 , e150

DOI: https://doi.org/10.1017/pasa.2025.10106 [Opens in a new window]

NASA ADS Abstract Service [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2025. Published by Cambridge University Press on behalf of Astronomical Society of Australia

1. Introduction

Integral field spectroscopy (IFS) has transformed our understanding of galaxies by efficiently enabling spatially resolved studies of their internal structures, dynamics, and star formation processes. For comprehensive reviews, see Cappellari (Reference Cappellari2016) and Sánchez (Reference Sánchez2020). Over the past two decades, several pioneering IFS surveys, including the SAURON project (Bacon et al. Reference Bacon2001), ATLAS3D (Cappellari et al. Reference Cappellari2011), the CALIFA survey (Sánchez et al. Reference Sánchez2012), the SAMI Galaxy Survey (Croom et al. Reference Croom2012; Bryant et al. Reference Bryant2015), the MaNGA survey (Bundy et al. Reference Bundy2015), the KMOS $^\textrm{3D}$ survey (Wisnioski et al. Reference Wisnioski2015), and the MAGPI Survey (Foster et al. Reference Foster2021), have provided valuable insights into the evolution of galaxies across a wide range of masses and morphologies. The Hector Galaxy Survey (Bryant et al. Reference Bryant, Bryant, Motohara and Vernet2024, Bryant et al. in preparation) builds on the success of its predecessor, the SAMI Galaxy Survey, expanding its scope to encompass a larger and more diverse sample of up to 15 000 galaxies at $z\lt 0.1$ , including low-mass galaxies and blue galaxies in dense environments that were underrepresented in previous large IFS surveys. With enhanced spatial coverage, higher spectral resolution in the new spectrograph, a wider field of view, and an upgraded data reduction pipeline, the Hector Galaxy Survey aims to address critical questions in galaxy evolution, including the role of environment in the build-up of angular momentum, the nature of low-mass galaxies, gas feeding and feedback processes, and how these factors influence star formation.

IFS data are inherently complex and require robust data reduction pipelines to produce accurate and reliable three-dimensional $(x, y, \lambda)$ data cubes. The Hector data reduction pipeline builds on the framework established by the SAMI reduction pipeline (Allen et al. Reference Allen2014), which was initially developed using algorithms from Sharp et al. (Reference Sharp2015). The SAMI pipeline has been continuously refined and enhanced through successive data releases, with significant contributions from Allen et al. (Reference Allen2015), Green et al. (Reference Green2018), Scott et al. (Reference Scott2018), and Croom et al. (Reference Croom2021). Building on this well-established framework, the Hector data reduction pipeline incorporates several improvements to address the increased complexity of Hector’s data. Hector uses state-of-the-art hexabundles (Bland-Hawthorn et al. Reference Bland-Hawthorn2011; Bryant et al. Reference Bryant, Bland-Hawthorn, Fogarty, Lawrence and Croom2014; Brown et al. Reference Brown, Wang, Bryant, Leon-Saval, Navarro and Geyl2018; Wang et al. Reference Wang, Brown, Bryant, Leon-Saval, Barto, Breckinridge and Stahl2019, Reference Wang, Brown, Bryant, Leon-Saval, Evans, Bryant and Motohara2020, Reference Wang, Brown, Bryant and Leon-Saval2023) to simultaneously collect spatially resolved spectra from 21 objects across a 2-degree field, with two bundles specifically dedicated to simultaneous flux calibration using secondary standard stars. Unlike SAMI, which used fixed-diameter 15.7 arcsec hexabundles, Hector features hexabundles of varying sizes (12 to 26 arcsec diameter), providing greater flexibility in spatial sampling and aiming to cover at least 2 effective radii for 70% of target galaxies. These hexabundles feed into two dual-arm spectrographs: 8 hexabundles connect to the original AAOmega (Sharp et al. Reference Sharp2006), previously employed for SAMI, while the remaining 13 feed into the newly developed Spector spectrograph (Bryant et al. Reference Bryant, Bryant, Motohara and Vernet2024). Spector provides higher spectral resolution, improved throughput, and broader wavelength coverage compared to AAOmega. On the other hand, AAOmega features larger bundle sizes, which are advantageous for observing galaxies with larger angular size. As a result, the data reduction pipeline requires substantial revisions for the Hector Galaxy Survey, particularly to accommodate and process the data from both instruments effectively.

Hector was commissioned on the Anglo-Australian Telescope (AAT) in 2022 and the galaxy survey officially commenced in 2023. Hector galaxy targets are selected from the Wide Area VISTA Extra-Galactic Survey (WAVES) region (Driver et al. Reference Driver, Napolitano, Longo, Marconi, Paolillo and Iodice2016; Kaur et al. Reference Kaur, Bilicki and Hellwing2025), with additional cluster galaxies included (Owers et al. in preparation). They span the redshift and mass ranges $0\lt z\lt 0.1$ and $10^7\lt M_*/M_{\odot}\lt 10^{12}$ and are distributed following a stepped selection in the stellar mass–redshift plane, similar to the approach used for the SAMI survey (Bryant et al. Reference Bryant2015). Detailed information on the observed galaxies and the target selection strategy is provided in the forthcoming Hector target selection paper (Barsanti et al. in preparation).

In this paper, we provide an overview of the data reduction processes and a comprehensive verification of the Hector early science dataset, comprising data cubes for 1 539 unique galaxies that incorporate observations from 13 observing runs conducted during 2023 and 2024. The structure of this paper is as follows. In Section 2, we describe the data processing and quality control measures. Particular attention is given to key enhancements in the reduction pipeline, including the integration of Spector data, the adoption of a two-dimensional modelling approach for wavelength solutions, and the correction of chromatic optical distortion in the 2-degree Field (2dF; Lewis et al. Reference Lewis2002) corrector. In Section 3, we verify the quality of the early science data through assessments of signal-to-noise (S/N), spatial and spectral resolution, and World Coordinate System (WCS) accuracy. Additionally, we present example spectra, kinematic maps, and emission-line maps to demonstrate that the early science dataset is ready for scientific analysis. We summarise the paper in Section 4. Throughout the paper we assume a cosmology with $\Omega_\textrm{m} = 0.3$ , $\Omega_{\Lambda} = 0.7$ , and $H_0 = 70$ km s $^{-1}$ Mpc $^{-1}$ .

Table 1. A summary of Hector spectral resolution at the central wavelengths $\lambda_{central}$ . This table provides data for all four CCDs including wavelength coverage ( $\lambda_{range}$ ) in Å, central wavelength $\lambda_{central}$ in Å, median FWHM of best-fit Gaussian to the instrumental LSF (FWHM) in Å, median standard deviation of the Gaussian fit ( $\sigma$ ) in Å, spectral resolution at $\lambda_{central}$ ( $R_{\lambda_{central}}$ ), velocity resolution (FWHM) in km s $^{-1}$ , and dispersion resolution ( $1\sigma$ ) in km s $^{-1}$ .

2. Data processing and quality control

The Hector data reduction pipeline has been refined to accommodate the advanced features and complexities of the Hector instrument. Hector employs two dual-arm spectrographs, simultaneously collecting data from four CCDs: AAOmega blue (CCD1), AAOmega red (CCD2), Spector blue (CCD3), and Spector red (CCD4). Among these, the newly developed Spector spectrograph introduces several differences compared to the AAOmega spectrograph. Notably, the Spector CCDs feature larger format detectors (4 096 $\times$ 4 112 pixels), enabling finer spectral resolutions compared to the AAOmega CCDs (2 048 $\times$ 4 096 pixels). Additionally, Spector provides broader wavelength coverage, effectively bridging the gap between the blue and red arm data observed in AAOmega, thus enhancing its overall spectral capabilities. For example, Spector includes important spectral features such as the Na d lines at 5 890/5 896 Å. See Table 1 for a summary of the wavelength coverage and spectral resolution of the two spectrographs.

In this section, we provide an overview of the data processing, highlight the improvements introduced in the pipeline, and assess the outcomes. The data processing for Hector has three main stages: data reduction, flux calibration, and data cube generation. Although the entire process is managed by the Hector reduction package, written in Python, it primarily acts as a wrapper for data reduction, invoking 2dFdr multi-fibre reduction pipelineFootnote ^a AAO Software Team (2015). The subsequent steps, including flux calibration and cubing, are handled directly by the Hector reduction package.

2.1 Data reduction

The data reduction, predominantly executed by 2dFdr, involves several essential tasks: adjusting for background signals, mapping and extracting spectra from individual fibres, calibrating wavelengths, correcting illumination discrepancies, and removing sky background. In this section, we detail our data reduction strategy and assess the output quality. Specifically, we highlight a new method for deriving accurate wavelength solutions through two-dimensional (2D) modelling of arc frames.

2.1.1 Overscan and bias corrections

2dFdr applies bias subtraction and corrects for pixel-to-pixel sensitivity. The bias level is computed by fitting the overscan region of CCD1 with a combined exponential and polynomial function, and those of CCD2, CCD3, and CCD4 with a polynomial function, which is then subtracted from the image. The image is then trimmed to remove the overscan region. We have verified that applying additional bias correction frames alongside the overscan correction results in negligible differences, and therefore, no further bias correction is applied.

2.1.2 Read noise and gain

For Spector CCDs (CCD3 and CCD4), we assessed the consistency of the nominal read noise and gain across the fast, medium, and slow readout modes of the new Spector spectrograph by comparing them to manufacturer specifications. All three readout modes produced results consistent with the specifications. We selected the medium readout mode for the Hector survey to achieve an optimal balance between readout time and noise performance. For CCD3 medium read-out mode, the measured read noise and gain are 2.96 e $^-$ and 1.42 e $^-$ /ADU, respectively, compared to the manufacturer specifications of 3.17 e $^-$ and 1.49 e $^-$ /ADU. For CCD4, the measured read noise and gain are 3.84 e $^-$ and 1.18 e $^-$ /ADU, respectively, compared to the manufacturer specifications of 3.93 e $^-$ and 1.12 e $^-$ /ADU. Given the small discrepancies between the measured and manufacturer-specified values, we adopt the manufacturer specifications for calculating the variance arrays.

Observations with the AAOmega CCDs were conducted in normal read-out mode, using the same setup as that employed for the SAMI survey (Croom et al. Reference Croom2021).

2.1.3 Bad pixel mask and cosmic ray rejection

Bad pixel masks are built by identifying pixels with non-linear flux or any residual bias structure. We take the ratio of defocussed flat fields taken on different nights with different exposure times to test for linearity. Pixels consistently deviating more than 3 $\sigma$ from the normalised flat median flux across different exposure times and dates are flagged as bad to avoid occasional cosmic rays or saturated pixels. Three columns (x = 2 046, 2 047 and 2 048) close to the edge of CCD2 show $\sim$ 95% of pixels as bad, so we have masked these entire columns. Due to the camera reading setup, CCD4 previously had an extra line of overscan at x = 2 049, which appeared as an extra bias column in the middle of the detector, shifting all the data by one pixel towards the end of the detector from that column onwards. As of May 2024, the camera reading setup has been corrected, resolving this issue. For data taken before this correction, we do not flag this column as bad, but we apply a correction to relocate it to the end of the detector. The total number of bad pixels for CCDs 1 through 4 are 2 398 (0.028%), 14 339 (0.169%), 4 095 (0.024%), and 8 208 (0.048%), respectively. Even accounting for gradients in our flat-field normalisation via using per-column medians instead of a single median value, the bad pixels exhibit persistent non-linearity and significant deviations ( $\gt\! 3\sigma$ ) across multiple exposure times and dates.

With 30-min exposure times for individual object frames, many are significantly affected by cosmic rays. Cosmic ray detection is performed using the Laplacian edge detection algorithm (L.A.Cosmic; van Dokkum Reference van Dokkum2001). Saturated pixels are also flagged for each frame. All pixels flagged for non-linearity, saturation, or cosmic ray effects are excluded from further processing and assigned NaN values. Remaining bad pixels, including broadened cosmic rays, are further removed just before cube reconstruction, as described in Section 2.5.

2.1.4 Extraction of spectra and removal of scattered light

Central to the data reduction of fibre-fed spectroscopy is extracting individual spectra from the 2D CCD image. This part of the data reduction is done using 2dFdr and closely follows the approach used for the SAMI Galaxy Survey (Croom et al. Reference Croom2021). Here, we briefly outline the key steps and note some minor changes compared to the SAMI pipeline.

First, the locations of the fibre paths across the CCD (commonly called tramline maps) are approximately traced by identifying each fibre in a flat field frame. Next, a higher-precision estimate of the tramline maps is made, by fitting Gaussian profiles to the fibre flat field. The centre of the Gaussian defines the tramline map, while the width is an estimate of the width of the fibre profile, used as part of the extraction process. Scattered light is estimated by averaging the counts in the gaps between slitlets (see Figure 5 of Croom et al. Reference Croom2021), then fitting smooth cubic splines along the gaps. Next, a second cubic spline is fit across the slitlet gaps to build a 2D model of the scattered light across the full detector. The spline uses 8 knots that are approximately equally spaced, depending on the gaps between slitlets. There are more gaps between slitlets for Spector (20) compared to AAOmega (12); see Bryant et al. (in preparation) for details. For the AAOmega blue arm only, we also model extra scattered light around the bright 5 577 Å line. This is not required for Spector, as the scattered light performance of the Spector optics is considerably better than that of AAOmega. The overall level of scattered light can be estimated by taking the ratio of the average flux across the image in the scattered-light model and the average extracted flux in the fibres. For a flat field image this ratio $f_\textrm{sl}/f_\textrm{ex}\simeq0.072$ , 0.068, 0.040, and 0.024 for CCDs 1 through 4, respectively. Once the scattered light is fit and subtracted, the flux in the fibres is calculated by fitting the previously defined Gaussian profiles of the fibres, allowing only the amplitude to change. This fit is done per CCD column, fitting for the amplitude of all fibres at the same time.

2.1.5 Wavelength calibration

The new Spector spectrograph has higher spectral resolution than previous large-scale IFS surveys (AAOmega is used in the same format and resolution as in the SAMI Galaxy Survey). The higher resolution is most significant in the blue part of the spectrum, which is particularly valuable for stellar kinematics and stellar population analysis. The blue arm of Spector has a resolution of $R\simeq3\,400$ at 4 800 Å, (see Table 1) compared to $R\simeq1\,800$ for SAMI (Scott et al. Reference Scott2018), $R\simeq1\,650$ for CALIFA (Sánchez et al. Reference Sánchez2012) and $R\simeq2\,000$ for MaNGA (Law et al. Reference Law2021). The higher resolution enables unique science, such as stellar kinematics of dwarf galaxies or detailed kinematic decomposition of emission lines in wind galaxies. To make the most of this advantage, it is particularly valuable to have a robust and high-quality wavelength calibration.

The usual approach to wavelength calibration is to fit a polynomial to the relationship between pixel coordinates and wavelength for arc frames taken using hollow cathode lamps. This is typically done independently per fibre. The per-fibre approach has some drawbacks. First, some part of the arc lamp spectrum may have only weak lines, making the calibration less secure at those wavelengths. Second, small differences between solutions of adjacent fibres can lead to unphysical differences in calibration between fibres that contribute to the same spaxels in the final data cubes. Third, as line identification is automated in massively multiplexed surveys, occasional failures to identify the correct lines can lead to poor solutions for individual fibres, particularly near the ends of a spectrum.

All the above problems can be addressed by using an approach that finds the wavelength solution across the entire detector at once. Childress et al. (Reference Childress, Vogt, Nielsen and Sharp2014) use this approach for wavelength calibration of the Wide Field Spectrograph on the Siding Spring 2.3 m Telescope. Inspired by this approach, we implement a similar method for reduction of Hector data. The Childress et al. (Reference Childress, Vogt, Nielsen and Sharp2014) approach fits a physically motivated optical model to the arc solutions. The one drawback of this is that it is typically slow to fit (being a non-linear model). Instead, we use a model that is linear, allowing fast and reliable solutions to be derived (typically $\sim 1$ min to generate the solution for an entire arc frame on a standard laptop) but retaining some of the general physical characteristics expected of the system.

The estimated wavelength for a given x,y coordinate on the detector and fibre f is given by

(1)

\begin{equation}\lambda(x,y,f) = \sum_{i=0}^{N_x} \sum_{j=0}^{N_y} a_{ij} T_i(x) T_j(y) + b_f,\end{equation}

where $a_{i,j}$ are the polynomial coefficients that parameterise the effect of the grating and optical distortion across the detector. The $b_f$ parameter is a single constant per fibre that captures features such as small misalignments between fibres as part of the construction of the spectrograph slit. The $T_i(x)$ and $T_j(y)$ are orthogonal Chebyshev polynomials of order i and j that we use as the basis functions of the polynomial part of the model. The polynomial order required depends on the spectrographs, with CCD1 and CCD2 needing $N_x=N_y = 4$ , while CCD3 and CCD4 have a higher-order optical distortion and need $N_x=N_y = 6$ .

This 2D arc fitting approach is implemented in Python as a post-processing of the arc frames after 2dFdr, using ridge regression within the scikit-learn package Pedregosa et al. (Reference Pedregosa2011). It makes use of 2dFdr’s identification of arc lines, together with the tramline maps generated during the fibre extraction process.

In Figure 1, we show example results from the 2D wavelength calibration for CCD3 (other spectrograph arms are qualitatively similar). Figure 1(a) shows a histogram of the residuals from the 2D fit that in this case has a dispersion of 0.079 Å, which is relatively large ( $\simeq0.14$ pixels), reflecting the low S/N ratio of some blue arc lines. The distribution of residuals across the detector is seen in Figure 1(b); while there is scatter and individual lines may have small shifts, globally the residuals are flat and close to zero. Based on Figure 1(b), a small number of lines are removed from the fit because they are consistently offset from the global solution. In Figure 1(c) and (d), we show the residuals projected onto the x and y axes of the detector. The coloured crosses show the mean residual from the 2D model in 10 $\times$ 10 bins across the detector. The mean residuals are consistently below $\simeq0.05$ pix and usually better than this.

Figure 1. The result of 2D wavelength calibration for an example arc frame from CCD3 (frame 19, 28 October 2024). (a) Histogram of residuals from the model fit, defined as (measured arc line wavelength) – (model wavelength). The dotted lines mark $\pm0.1$ pixels on the detector. (b) Residuals across the detector. (c) Residuals as a function of detector x pixel (i.e. the vertical collapse of panel (b)). Small red points are individual line measurements, various coloured connected points are locally averaged residuals in a 10 $\times$ 10 grid across the detector. (d) Small red points are residuals as a function of y detector pixel (i.e. the horizontal collapse of panel (b)). Coloured points are average residuals, as for (c).

One further step is applied in the wavelength calibration to allow for slow drifts in the slit position relative to the camera through the night (see Sharp et al. Reference Sharp2015). The maximum absolute shift is typically $\simeq$ 0.04 Å per hour in AAOmega (largely caused by flexure changes as liquid nitrogen boils off in the camera dewars). For Spector the change is significantly smaller ( $\simeq0.006$ Å per hour), as the cameras are more compact and thermally controlled, and is related to low-level residual thermal changes.

In each object frame the sky lines are used to derive a relative shift compared to the nominal arc frame calibration. A robust linear fit is used, so that the adjustment varies smoothly as a function of CCD x and y pixel, using the same approach as discussed by Croom et al. (Reference Croom2021). However, the fibre-to-fibre adjustment to the wavelength calibration based on twilights that was used for SAMI is not required for Hector.

As an independent check of the wavelength calibration we fit a high-resolution solar template to twilight frames, dividing each fibre into between 10 and 30 smaller chunks as a function of wavelength. We do this using the penalised pixel-fitting (pPXF; Cappellari & Emsellem Reference Cappellari and Emsellem2004; Cappellari Reference Cappellari2017) code in a process that is also used to test the line-spread-function (see Tuntipong et al. in preparation). We then measure the scatter in velocities measured across all the chunks for a frame. Histograms of the residual velocity for each CCD with 1D and 2D arc fitting can be seen in Figure 2. Here, we show the residuals in km s $^{-1}$ rather than Å as this is the direct output of the pPXF fits and is more relevant for science analysis (e.g. kinematic fitting of galaxy spectra). We note that at 4 800 Å, 1 km s $^{-1}$ is equivalent to 0.016 Å (or 0.015 pix for CCD1 and 0.029 pix for CCD3). At 6 800 Å, 1 km s $^{-1}$ is equivalent to 0.023 Å (or 0.040 pix for CCD2 and 0.044 pix for CCD4). The standard deviations of the distributions for the 1D and 2D fitting approaches are shown in the legends in Figure 2. We consistently see improvements moving to the 2D method. To further quantify the level of improvement, we subtract the median statistical uncertainty from pPXF (related to the S/N and number of spectral features in the twilight spectra) from the standard deviations in Figure 2, to provide an estimate of scatter due to only wavelength calibration. For CCD1, the root-mean-square (RMS) scatter in velocity changes from 4.3 to 2.5 km s $^{-1}$ (an improvement of a factor of 1.7). For CCD2, the improvement was more modest, from 1.6 to 1.3 km s $^{-1}$ (a factor of 1.2). For CCD3, the improvement is from 2.2 to 1.2 km s $^{-1}$ (a factor of 1.9). For CCD4, the improvement is large, from 5.7 to 2.0 km s $^{-1}$ (a ratio of 2.9). The large improvement in CCD4 is in part due to the high-order optical distortion in that camera, which is hard to robustly fit on an individual fibre basis. It is worth noting that the scatter in the velocity residuals in CCD2 and CCD4 are likely somewhat overestimated, as telluric absorption impacts the twilight spectrum. Wavelength ranges with obvious telluric absorption have been removed, but some weak effects may remain. Together with the above improvements, the 2D fitting completely removes catastrophic failures.

Figure 2. Comparison of twilight sky velocity residuals from 1D (blue) and 2D (red) arc fitting. We show CCDs 1 through 4 in panels (a), (b), (c), and (d), respectively. The broader distribution for 1D fitting in CCD4 is in part due to the difficulty of fitting the high-order distortion on single fibres (see text for details). The standard deviations shown in the legends are before correcting for statistical velocity measurement uncertainty. Vertical dotted lines indicate the velocity corresponding to 0.1 pixels at 4 800 Å (for CCD1 and CCD3) and 6 800 Å (for CCD2 and CCD4).

Future development of wavelength calibration will aim to look at the long-term stability of the 2D fitted parameters and where possible constrain these to physically motivated constant values. For example, the coefficients related to shifts between fibres in the slit should not change, given the physical construction of the slit. Averaging over many datasets should provide an even more accurate estimate for fibre-to-fibre positions. We will also monitor for temporal drifts in the coefficients and check that the solution remains appropriate for the data by comparing residuals.

2.1.6 Flat fielding

Using the wavelength solutions, we extract and process dome flat frames for each tile. Additionally, twilight flat frames, obtained whenever possible during observations to enhance various calibration aspects, are processed. In the SAMI Galaxy Survey, the dome flat frames had extremely faint signals at the blue end of the spectrum in AAOmega blue, so twilight flat frames were used for flat fielding in the blue arm. Since the SAMI observations concluded in 2018, several updates have been made to the AAT dome flat lighting system, including the installation of more blue lights. These improvements have led us to re-examine the spectral uniformity of the dome flat frames for calibration.

Figure 3 presents reduced and normalised spectra from the dome flat and twilight flat frames. The spectra were extracted from the fibre in the middle of each CCD, and normalised for detector gain, photon energy, collecting area, and spectral resolution. We observe several peaks at blue wavelengths ( $\lesssim 4\,500$ Å) in the dome flat spectra of the blue arm, caused by the installation of an array of different LEDs to boost blue light levels. The strong signals at the blue end greatly help reduce uncertainties in light extraction, but the presence of narrow and strong peaks combined with non-uniform illumination across the focal plane leads to residuals in the reduced dome flat frames. These persist even after normalising the flat spectra from each fibre by the median spectrum across all fibres. We find that the reduced dome flat frames exhibit residual gradients, resulting in up to 3% discrepancies compared to the reduced twilight flat frames in the blue arm, which can also introduce artificial colour gradients across the spectrum. In contrast, the red arm dome flat spectra exhibit much smoother spectral uniformity, without the residuals or colour gradient observed in the blue arm.

Figure 3. Flux density derived from dome flat (top) and twilight flat (bottom) frames, extracted from the fibre in the middle of each CCD. The flux density is normalised for gain, photon energy, collecting area, and spectral resolution. The dark and light blue spectra originate from the blue arms of AAOmega and Spector, respectively, while the red and orange spectra are from their red arms. The flat spectra, converted to a flux density scale, illustrate the shape of the flat-field frames and only indirectly reflect relative throughput. For absolute throughput measurements for Hector, refer to Section 2.3.2.

We therefore adopt the SAMI convention for flat fielding by using twilight flat frames for the blue arms, after carrying out a spline fit to the twilight generated flat field to remove the residual impact of solar absorption features in the twilight spectrum. The majority of object frames (68%) were flat-fielded using twilight flats from the same tile configuration, while the remaining 32% used twilight flats from different tiles within the same observing run. For the red arm, we use dome flat frames for flat fielding.

While twilight flats remain the default for blue-arm calibration, we will explore the feasibility of using dome flats alone in future releases. Recent upgrades to the dome flat lighting system have improved the blue-light signal, but further work is needed to address spectral non-uniformities. Ongoing development will focus on characterising and correcting these effects, for example, through empirical illumination corrections.

2.1.7 Correcting fibre-to-fibre variations in throughput

Accurately correcting fibre-to-fibre throughput variations is essential for ensuring reliable flux calibration and uniformity in spectral data. This is achieved using throughput maps derived from twilight flat observations, taken with the same fibre configuration on the same day. Twilight flats are ideal for this purpose, as they provide uniform and consistent illumination across all fibres, enabling precise calibration of their relative sensitivities. When twilight flats cannot be acquired due to challenges like poor weather, the pipeline adopts an alternative approach by generating throughput maps from dome flat frames specific to the same tile. The relative throughput values estimated from dome flats show reasonably good agreement with those derived from twilight flats. For example, in frames taken in November 2024, 90%, 92%, 91%, and 88% of fibres on CCDs 1 through 4, respectively, exhibit discrepancies of less than 1% between the two estimates. Additionally, if the residuals after sky subtraction (as described in Section 2.1.8) are large in frames where dome flats were used for throughput correction, the pipeline switches to a sky-line-based throughput correction, using the fluxes of night-sky lines averaged across multiple frames taken for a single tile.

The relative throughput for each twilight flat (or occasionally a dome flat) is calculated by determining the mean flux for each fibre while excluding bad pixels and then normalising these mean values across fibres using their median. If multiple twilight flat frames are available for a tile, the final relative throughput is obtained by averaging the throughput values from all available frames. To correct throughput variations, each fibre’s spectral data is divided by its respective throughput value, ensuring consistent normalisation across fibres. Fibres with invalid throughput values (e.g. NaN or values below zero) are flagged, and their spectra are excluded from further analysis.

2.1.8 Sky subtraction

Sky subtraction is performed following the same approach used by the SAMI Galaxy Survey (Sharp et al. Reference Sharp2015; Croom et al. Reference Croom2021). A median sky spectrum is calculated from the sky spectra per frame and subtracted from the data. The main difference between SAMI and Hector is that the sky fibres are located around the edge of the field-of-view. Hector also has a larger number of sky fibres than SAMI (at least in part to compensate for not all sky fibres being active at any one time). To assess the level of sky-subtraction accuracy, we calculate the median residual sky in sky-subtracted sky spectra. The median absolute per-fibre fractional continuum sky residual (i.e. flux in sky-subtracted spectrum divided by flux without sky subtraction) across all data from 2023 and 2024 is 0.014, 0.016, 0.012, and 0.010 for CCDs 1 through 4, respectively. The better performance for CCD3 and CCD4 reflects the overall lower level of scattered light in the Spector spectrographs (see Section 2.1.4). We also calculate the sky residuals left in night sky emission lines. The median absolute per-fibre fractional sky-line residuals are 0.011, 0.012, 0.015, and 0.010 for CCDs 1 through 4, respectively. We only calculate the sky-line fractional residuals at the location of strong night-sky emission lines.

2.2 Chromatic variation in distortion correction

The Hector instrument uses the 2dF facility’s corrector lens system, which provides a 2-degree diameter field of view at the AAT prime focus. An atmospheric dispersion corrector (ADC) is built into the front two elements of the corrector to compensate for atmospheric dispersion at zenith distances up to 67 degrees. These two elements are counter-rotating prismatic doublets that introduce equal and opposite dispersion, effectively counteracting atmospheric effects as the telescope tracks across the sky (Lewis et al. Reference Lewis2002) and providing real-time correction during observations.

In the absence of an ADC, as in the case of Hector’s predecessor, SAMI, corrections for differential atmospheric refraction were performed during data reduction, relying on knowledge of the altitude, parallactic angle, and atmospheric conditions (temperature, pressure, and humidity; Croom et al. Reference Croom2012; Bryant et al. Reference Bryant2015). A similar post-processing strategy has also been implemented in the MaNGA survey (Law et al. Reference Law2015, Reference Law2016). The CALIFA survey (Sánchez et al. Reference Sánchez2012), on the other hand, adopts a more direct approach, where the differential atmospheric refraction is first estimated from reconstructed data cubes, after which the cubes are regenerated incorporating the measured effect (Sánchez Reference Sánchez2006; García-Benito et al. Reference García-Benito2015).

Although the ADC corrects for atmospheric refraction, residual optical effects from the corrector system introduce wavelength-dependent shifts in image position, i.e. chromatic variations in distortion (CVD). If not accounted for, these shifts can result in an underestimation of the extracted flux in the reduced data frames.

In this section, we describe the construction of the Hector optical model, which we integrate into the data reduction pipeline to correct for the CVD effects.

2.2.1 Distortion dependence on field radius and wavelength

As part of the Hector instrument commissioning phase, we conducted stellar observations to map chromatic variation in distortion across the 2-degree field of view of the Hector plate. For each stellar observation, we quantified the stellar centroids shifts by fitting a Moffat profile in 100 Å wavelength intervals. Figure 4 presents these distortions as a function of both radial distance from the centre of the Hector plate and wavelength, using the coordinate system of the Hector robotic positioning system.

Figure 4. Stellar observations illustrating the effects of chromatic variations in distortion (CVD) are shown as a function of wavelength and position across the Hector plate, presented in the coordinate system used by the Hector robot. Black-filled circles mark the stellar centroid at a reference wavelength of 6 000 Å, while coloured points trace the shift in the centroids of stellar observations across wavelength, shifting from redder to bluer wavelengths (red-to-blue filled-in circles) relative to the centroid at the reference wavelength. For clarity, the centroid shifts due to CVD effects are exaggerated by a factor of 20; the maximum shift is $\sim$ 120 $\unicode{x03BC}$ m (1.17 times the fibre core diameter). For several hexabundles, we also illustrate the hexabundle orientation and cable direction (see Section 3.4 for discussion on the orientation of hexabundles and associated corrections). Grey lines connect the physical centres of each hexabundle to the centre of the Hector plate.

Each stellar observation is colour-coded from blue to red, indicating the centroid shift with increasing wavelength relative to the centroid at a reference wavelength of 6 000 Å. For clarity, the magnitude of distortion is exaggerated, with the maximum variation reaching 120 $\unicode{x03BC}$ m. The solid grey lines connect the stellar centroid at the reference wavelength to the centre of the Hector plate.

We model the distortion across the Hector plate using a polynomial function with terms $\alpha^7$ , $\alpha^5$ , $\alpha^3$ , and $\alpha^1$ , where $\alpha$ represents the field radius. The wavelength dependence of each coefficient is then parameterised using a quadratic function.

Figure 5(a) compares the modelled distortion with the values observed along a radial direction of the plate. As in Figure 4, each vertical colour-coded set of points represents the centroid offsets relative to the reference position for each stellar observation. The modelled quadratic distortion pattern is shown for the bluest (3 730 Å) and reddest (7 330 Å) wavelengths of the Hector data as solid blue and red lines, respectively.

Figure 5(b) presents the residuals between the model and the data at three different wavelengths, demonstrating that the residuals are within $\pm5\,\unicode{x03BC}$ m at $\lambda\gt 4\,600$ Å, approximately one-fifth the size of 1 fibre core (Bryant et al. Reference Bryant, Bryant, Motohara and Vernet2024). The relatively larger scatter observed at the blue points, corresponding to residuals at 3 800 Å, is largely driven by the reduced signal-to-noise at the blue end of the blue CCDs, increasing the scatter in the measured centroid positions.

The final two panels, Figure 5(c) and (d), show the RMS of the residuals. The panel (c) presents the RMS as a function of plate radius, with points colour coded from blue to red to indicate increasing wavelength, and (d) shows the RMS as a function of wavelength.

2.3 Flux calibration

2.3.1 Primary flux calibration

We derive the transfer function for flux calibration, $\mathcal{T}(\lambda)$ , using primary standard stars, selected from A- or F-type stars listed in the high-resolution, telluric-corrected spectra provided by the Supernovae Factory projectFootnote ^b Aldering et al. (Reference Aldering, Tyson and Wolff2002). These stars were typically observed three times per night for each instrument, whenever conditions allowed.

Since we use telluric-corrected spectra as our reference, the first step involves applying telluric corrections to the reduced primary standard star frames. See Section 2.4 for details on the telluric correction process for Hector. After applying CVD corrections, we extract the standard star spectrum from each calibration frame using 2D Moffat fitting to better account for the PSF wings. Atmospheric extinction is corrected by scaling the standard extinction curve for Siding Spring Observatory to the effective airmass of each observation and adjusting both the flux and variance. Figure 6(a) and (b) demonstrate that the Moffat fitting method efficiently extracts stellar spectra with fluxes that are approximately 1–1.2 times higher than those obtained by summing the flux over the entire bundle, minimising noise contamination and recovering flux lost in the gaps between fibres. Even though we use high-resolution standard spectra (often with 1–4.2 Å bins) as a reference, the observed spectra exhibit even finer spectral resolution and binning. Consequently, we re-bin the observed spectra to match the coarser scale of the reference spectrum, a step also employed in SAMI DR3. Additionally, we introduce a new step: convolving the observed spectra to match the resolution of the reference spectra. This convolution helps prevent overestimation of the transfer function, particularly in the presence of strong absorption lines. For example, without convolution, the transfer functions are typically overestimated by 0.5–2% below 4 000 Å, with larger discrepancies in the presence of strong absorption lines.

Figure 5. Modelling the Chromatic Variation in Distortion across the Hector plate. (a) Distortion as a function of position along the Plate y-coordinate across the Hector plate, from left-to-right, as shown in Figure 4. Also, as in Figure 4, the colour gradient from blue to red represents measured centroid offsets as a function of wavelength. The modelled distortion at wavelengths of 3 730 and 7 330 Å is shown as solid blue and red lines, respectively. (b) Residuals between the model and observed distortions at 3 800, 5 000, and 7 200 Å, demonstrating that the model effectively reproduces the measured distortions across the Hector plate to approximately within $\pm 10 \unicode{x03BC}$ m. (c) RMS of the residuals as a function of radius on the Hector plate, with colours indicating increasing wavelength from blue to red. (d) RMS of the residuals as a function of wavelength, illustrating that RMS progressively becomes larger towards bluer wavelengths.

Figure 6. Example of deriving the transfer function $\mathcal{T}(\lambda)$ from a primary standard star, LTT 3218, observed on 8 December 2023 using Hexabundle O from Spector blue. (a) Extracted standard star spectrum using Moffat fitting and integrated spectrum over the bundle. (b) Comparison between the extracted and summed spectra. (c) Ratio between the reference and unconvolved observed (extracted) spectra (grey). The transfer function (red dashed line), derived after convolving the observed spectrum to match the reference resolution, does not show local peaks at the positions of absorption lines. (d) Observed, reference, and flux-calibrated spectra. The flux-calibrated spectrum matches the reference well, while retaining sharper absorption features due to its higher spectral resolution.

We then apply a cubic spline fit to the flux ratios between the reference and modified observed spectra to derive the transfer function for each standard star frame (Figure 6(c)). To better address the characteristics of the Spector CCDs, which exhibit a sharp turn at the edge of the transfer function due to the dichroic, we increase the number of knots compared to the SAMI approach. For the AAOmega red CCD, which has a smooth and relatively consistent flux ratio, we use 6 knots for the fitting. For the AAOmega blue CCD, we use 8 evenly distributed knots and introduce an additional knot at each end of the wavelength range to more accurately capture any edge variations. The Spector blue CCD, with its sharper changes at the red end, requires additional knots in that region, while the Spector red CCD needs extra knots at the blue end. Specifically, we add 8 extra knots above 5 600 Å for Spector blue and below 6 000 Å for Spector red to ensure the fitting process accurately captures these edge effects. Figure 6(d) presents an example for the Spector blue CCD, comparing the reference, observed, and calibrated spectra. The transfer functions are median combined for each CCD within each observing run and applied to the reduced object frames for primary flux calibrations.

2.3.2 Sky to detector throughput estimation

After deriving the standard star transfer function, $\mathcal{T}(\lambda)$ , for each primary standard star frame, as described in Section 2.3.1, we convert it into a fractional throughput, $\eta(\lambda)$ , via

(2)

\begin{equation}\eta(\lambda)\;=\;\frac{1}{\mathcal{T}(\lambda)}\;\times\;\frac{h\,c}{\lambda\,A\,\Delta\lambda}\;\times\;\texttt{gain}\end{equation}

where h is Planck’s constant, c is the speed of light, A is the telescope’s collecting area, and $\Delta\lambda$ is the wavelength bin size.

We then combine $\eta(\lambda)$ from each standard-star exposure to form a ‘best’ and ‘mean’ throughput reference for each spectrograph arm. To determine the best throughput, we first exclude outlier throughput curves that are greater than $\pm20\%$ of the median of all curves. The best curve is the remaining curve with the highest throughput at specific wavelengths for each spectrograph arm. For the blue arms, we select the throughput with the highest value at 4 500 Å and for the red arms, we select the highest value at 6 500 Å. The best throughputs from our observations are shown in Figure 7. The best throughput represents the sky to detector throughput in the best conditions. Lower throughputs are in poorer conditions, typically due to weather. Notably, Spector demonstrates higher throughput across its entire wavelength range, exceeding AAOmega by more than 35% on the blue arm and 20% in the red arm near their respective peaks.

Figure 7. Sky to detector throughput achievable by Hector in the best conditions. Spector shows significantly higher throughput in both the blue and red arms relative to AAOmega.

To compute the mean throughput, we average $\eta(\lambda)$ across all standard-star frames and exclude any that deviate by more than 20% from this mean, thus removing cloud-affected or otherwise problematic exposures. As a further quality check during observations, we define transmission to be the ratio of the current throughput to the mean throughput. A low transmission indicates poor transparency (e.g. due to adverse weather), and such frames are flagged for possible re-observation. The overall shape of the mean throughput is similar to that of the best throughput curve, but, as expected, it exhibits a lower amplitude due to the averaging over multiple exposures.

2.3.3 Secondary flux calibration

The primary flux calibration is only approximate, as it assumes no change in the atmospheric conditions between the observations of the spectrophotometric standard stars and the galaxies. In practice, however, variations such as changes in airmass, telluric absorption, and sky conditions occur. Therefore, a more precise normalisation is achieved by observing two secondary standard stars–one feeding into AAOmega and the other into Spector–alongside the galaxies, using two dedicated hexabundles that are allocated for this purpose during each observation.

The Hector secondary standard catalogue is constructed from stars colour-selected to be of spectral type F, with an additional magnitude cut applied to ensure high signal-to-noise observations. Owing to their relative abundance in the Milky Way, and their comparatively smooth spectral energy distributions, F-type stars are commonly adopted as flux calibrators (Yan et al. Reference Yan2016). These stars are then compared to their photometric magnitudes to determine a modified transfer function. This secondary flux calibration process for Hector data follows the same approach used in the SAMI Galaxy Survey and described in Croom et al. (Reference Croom2021).

Prior to secondary flux calibration, we correct each reduced row stacked spectra (RSS) frame for atmospheric extinction, approximately flux calibrate using the primary standard, and correct for telluric absorption. The flux of the secondary standard star is then extracted from each RSS frame by fitting a Moffat profile, incorporating corrections for the CVD effects. These extracted spectra are fitted using the pPXF code, using Kurucz (Reference Kurucz1992) model atmospheres. Model atmospheres are used in preference to empirical reference spectra, as reliable observed spectra are not available for these secondary standards.

The pPXF fitting process consists of two steps: first, individual templates spanning a grid of effective temperature ( $T_{\text{eff}}$ ), metallicity ([Fe/H]), and surface gravity are fitted. Then, for the best-fitting surface gravity, the four nearest templates in $T_{\text{eff}}$ and [Fe/H] are refitted, allowing a linear combination of templates.

The fitting is performed only on the blue arm data, as it contains the prominent absorption features required to constrain the models, and includes an eighth-order multiplicative polynomial to correct residual transfer function errors. Moreover, the template weights are averaged across all observations within a field, typically encompassing seven observations of the same star, to derive a best-fitting template. This template is then normalised using the observed g- and r-band photometry of the star, applying the average normalisation from the two bands.

Although transfer functions can be derived for individual RSS frames by comparing the observed spectrum to the best-fitting template, this approach can introduce scatter. To mitigate this, we apply an averaged transfer function, computed from all observations of a standard star in a given field. Individual frame normalisation is still allowed to account for variations in transmission.

Figure 8. (a) Hector-to-SDSS flux ratio using 3-arcsec diameter aperture spectra as function of Hector PSF FWHM. (b) Distribution of Hector-to-SDSS flux ratio. Black squares denote the median and normalised median absolute deviation computed across four bins of the PSF FWHM. Blue and red horizontal lines denote the median value of AAOmega/Spector blue and red arms, respectively.

2.3.4 Flux calibration stability

Following a similar approach to Croom et al. (Reference Croom2021), here we compare 3-arcsec circular aperture spectra extracted from Hector datacubes to single-fibre SDSS spectra using a subsample of 151 galaxies. Apertures are placed in the datacubes at the sky location determined by the SDSS fibre coordinates PLUG_RA, and PLUG_DEC. Since SDSS fibre spectra are matched to PSF magnitudes, we account for this effect by scaling the fluxes by a factor $\simeq 0.72$ (i.e. 0.35 magFootnote ^c ).

Figure 8(a) shows the median Hector-to-SDSS flux ratio across both arms as function of the Hector PSF FWHM. Panel (b) shows the distribution of the flux ratios in both arms. This is equivalent to Figure 11 in Croom et al. (Reference Croom2021), where they reported a median offset and dispersion with respect to SDSS of 1.04 and 0.16, respectively. Here we report median ratio values of 0.86 and 0.87 (blue and red horizontal lines of Figure 8(b)), and dispersion 0.15 and 0.14 for the blue and red arms, respectively.

To further investigate the quality of our calibration as function of wavelength, we show in Figure 9 the 16, 50 and 84 percentiles of the ratio between Hector and SDSS spectra in bins of 100 Å for both arms. To isolate systematic trends with wavelength, each spectrum ratio has been normalised by the corresponding median value for its arm, as reported in the previous paragraph. These results can be directly compared with Figure 13 in Croom et al. (Reference Croom2021).

The calibration of the red arm shows minimal dependence with wavelength. The blue arm, on the other hand, presents a systematic decreasing offset inversely proportional to the wavelength, which is similar to the effect seen in the SAMI-SDSS comparison (Croom et al. Reference Croom2021). In contrast, Husemann et al. (Reference Husemann2013) reported a decreasing SDSS-to-CALIFA flux ratio toward the blue, which represents the opposite trend. These differences highlight that blue-end discrepancies can depend on the choice of reference dataset and calibration method, and should be taken into account when interpreting or comparing spectral shapes across surveys. Such wavelength-dependent offset can result from a combination of multiple effects in either the SDSS or Hector data sets, including a poorer signal on the blue end of the detector due to lower throughput, challenging the estimation of the transfer function, as well as problems related to atmospheric extinction. Nevertheless, the comparison of aperture fibre-like spectra is challenging as it is heavily affected by differences in the seeing conditions of both surveys as well as potential astrometric mismatches between both datasets.

Figure 9. (a) Flux ratio of Hector 3-arcsec diameter aperture spectra to SDSS fibre spectra. The median flux ratio is estimated in bins of 100 Å. Blue and red lines illustrate the 16th, 50th (filled circles) and 84th percentiles of the flux ratio as function of wavelength for both blue and red AAOmega/Spector arms, respectively. (b) Same as (a) but re-scaling each spectra by the median offset between SDSS and Hector.

We perform an additional test by comparing synthetic DECam g-and r-band (Flaugher et al. Reference Flaugher2015) photometry with Legacy Survey (LS) DR10 imaging data (Dey et al. Reference Dey2019). Synthetic photometry is derived by convolving Hector spectra with the DECam g and r filters using the Population Synthesis Toolkit (PST Footnote ^d ; Corcho-Caballero et al. Reference Corcho-Caballero, Ascasibar and Jiménez-López2025). Then we measure the curve of growth using circular apertures with diameters ranging from 2 to 20 arcsec for both LS and Hector datasets.

Figure 10. (a) Ratio of the Hector to Legacy Survey (LS) g-band aperture flux as function of aperture diameter. The black solid line and red (blue) region denote the median and 68% (90%) confidence interval, respectively, as function of aperture diameter. (b) Hector-to-LS flux ratio distribution for a 10-arcsec diameter circular aperture. The solid and dotted lines illustrate the median and dispersion (based on the 16th and 8th percentiles) of the distribution reported on the top-right corner of the panel. (c) and (d) Same as (a) and (b), respectively, using the r band and restricted to Spector cubes. (e) Aperture-based $g/r$ colour ratio between Hector and LS as function of aperture diameter. (f) Distribution of colour ratios for a 10-arcsec diameter circular aperture.

The results are summarised in Figure 10, where panels (a) and (b) illustrate the ratio distribution between Hector and LS circular apertures as a function of aperture diameter, for the g and r bands, respectively. In panel (b) the sample is restricted to Spector data, whose spectral range fully covers the r bandpass. The best agreement between both datasets is found for an aperture of $\simeq10$ arcsec in both bands.

Panels (b) and (d) show the flux ratio distribution computed using a 10-arcsec aperture in both bands. The median flux ratio in the g and r bands is 0.97 ( $\pm 0.10/0.14$ ), and 0.94 ( $\pm 0.07/0.12$ ), respectively, consistent with an absolute spectro-photometric accuracy of $\lesssim15\%$ . Panel (e) shows the $g/r$ flux ratio between Hector (Spector data only) and LS as function of aperture diameter as an additional proxy for colour stability. The median displays an almost perfectly flat trend at all apertures. Using the same aperture diameter as in (b) and (d), we show in panel (f) the $g/r$ flux ratio distribution between Hector and LS. We find a small fraction of outliers presenting elevated colour offsets of up to 50%. These objects appear more extended in the synthetic g-band maps than in the LS imaging data, potentially due to seeing differences, requiring a more careful photometric analysis (e.g. using seeing-dependent apertures). In addition, Figure 11 shows the g band offset as function of effective airmass, where no correlation between both quantities is detected. Overall, we find that $g/r$ flux ratio between Hector and LS imaging data presents a global median and dispersion values of $1.03\pm 0.09/0.11$ , indicating a relative spectro-photometric calibration of $\simeq10\%$ ( $\sim 0.1$ mag).

Figure 11. (a) Distribution of cube effective airmass. (b) g-band Hector-to-LS flux ratio distribution using a 10-arcsec diameter circular aperture as function of effective airmass. Red symbols denote the median and NMAD computed on bins equal to the x-axis error bars.

At the $\sim$ 10% relative $g/r$ uncertainty level implied by the Hector–LS comparison, the corresponding colour scatter is of order $\Delta(g{-}r)\simeq 0.1$ mag (with a median offset of $\simeq 0.03$ mag), which translates to an upper-bound $\Delta E(B{-}V)\simeq 0.1$ mag under standard extinction/attenuation curves (e.g. Calzetti et al. Reference Calzetti2000). This should be regarded as an upper limit, and the propagated effects on broad-wavelength inferences (dust attenuation and SPS) are expected to be modest and do not affect the qualitative trends in our early-science results (e.g. Conroy Reference Conroy2013; Walcher et al. Reference Walcher, Groves, Budavári and Dale2011). We also expect strong-line metallicities and stellar kinematics to remain effectively unchanged (Kewley & Ellison Reference Kewley and Ellison2008; Cappellari & Emsellem Reference Cappellari and Emsellem2004).

2.3.5 Overlap between blue- and red-arm spectra

We test how well the spector blue- and red-arm spectra aligned with each other, which may serve as an independent check on the accuracy of the flux calibration. Since the blue- and red-arm spectra from Spector overlap within a short wavelength range, we measured the flux differences at eight wavelength points ( $\lambda$ -points) within this region. Figure 12 shows the outcome of this test for an example galaxy. The means and noises of the spectra, estimated within $\pm5$ Å at each $\lambda$ -point, are presented in Figure 12(c). Here, noise is defined as the standard deviation of the flux values, which was estimated after removing local linear trends to account for spectral slope variations. Figure 12(d) shows the percentage flux difference between the blue- and red-arm spectra relative to the red-arm flux at 5 800 Å. The flux difference mostly appears to be less than 2 per cent, which falls within a reasonable scope considering the spectrum noise and the intrinsically low throughput in the overlap region (see Figure 7).

Figure 12. Examining the overlap between blue- and red-arm spectra for an example galaxy. (a) Overall shapes of the blue- and red-arm spectra, both normalised to the red-arm flux at 5 800 Å. (b) Zoomed-in view of the overlapping region. (c) Mean (solid lines) and standard deviation (dashed lines) of the flux values at eight wavelength points in this galaxy, each within a $\pm 5$ Å interval. The standard deviation was estimated after removing local linear trends. (d) The percentage difference between the blue and red fluxes relative to the red-arm flux at 5 800 Å (solid line) with its propagated uncertainty (dashed lines).

Figure 13 presents the statistics of the blue-red flux difference in percentage. For this, we produced a pair of blue- and red-arm spectra by integrating the data cubes within a central 3-arcsec radius for each galaxy observed using Spector. For the spector galaxy spectra with S/N $\geq$ 10 in Figure 13(b), their absolute mean values of the blue-red flux difference are close to zero ( $\lesssim 0.8\%$ ). The half value of the range between 16 and 84 percentiles, which approximately corresponds to the 1 $\sigma$ range if a normal distribution is supposed, is as large as $\approx 5\%$ at $\lambda$ -points 3–6. Figure 13(c) shows the blue-red flux difference divided by the propagated noise. The 16th-to-84th percentile half-range is mostly below one, indicating statistically reasonable agreement. The data distributions for each $\lambda$ -point in Figure 13(c) follow a normal distribution reasonably well, except that the distribution at $\lambda$ -point 1 exhibits significantly larger variance. This indicates a higher level of uncertainty at the blue end of the red-arm spectrum, while the data at the remaining $\lambda$ -points appear to be stable. It is worth noting that, at the edges of the overlap range, the throughput of the blue or red arm of Spector is extremely low ( $\lesssim 0.03$ ).

2.4 Telluric correction

We perform telluric corrections in a similar manner to SAMI DR3, using the molecfit (Smette et al. Reference Smette2015; Kausch et al. Reference Kausch2015) telluric fitting software with the equ.atm reference profile to fit for atmospheric absorption by H $_2$ O and O $_2$ molecules. The correction is fit to the extracted spectrum of the secondary standard star in each spectrograph over the full wavelength range in the red arm, and applied to each spectrum in the row-stacked spectra frames. Figure 14 panels (a) and (b) illustrate the correction for the AAOmega spectrograph for galaxy C901005481610591 and secondary standard star S481602915 observed concurrently. Panels (c) and (d) illustrate the corresponding correction for the Spector spectrograph, for galaxy C901005167806973 and secondary standard star S481609373. The galaxy and star spectra are extracted from a 3-arcsec radius aperture centred on the brightest spaxel in the cube.

Figure 13. Statistics of the blue-red flux difference, using the blue- and red-arm spectra integrated within a central 3-arcsec radius in the data cube for each galaxy observed using Spector. (a) Blue-red flux difference in percentage for all Spector galaxy spectra without a S/N cut. The number of blue-red spectra sets is given in parentheses. Note that the number of spectra exceeds the number of galaxies because some galaxies were observed multiple times. (b) Spector galaxy spectra with S/N $\geq$ 10. The values at the bottom show the median $\pm$ half the range between 16 and 84 percentiles at each wavelength point ( $\lambda$ -point; as defined in Figure 12). (c) The same as (b), but the blue-red flux difference is divided by the propagated noise at each $\lambda$ -point, not by the red-arm flux at 5 800 Å.

Figure 14. Telluric correction for CCD2 and CCD4. (a) 3-arcsec aperture spectrum for galaxy C901005481610591 and secondary standard star S481602915 after correction, observed with CCD2. (b) Telluric correction applied to both star and galaxy spectra in CCD2. (c) 3-arcsec aperture spectrum for galaxy C901005167806973 and secondary standard star S481609373 after correction, observed with CCD4. (d) Telluric correction applied to both star and galaxy spectra in CCD4.

While we do not perform a separate quantitative evaluation of the telluric correction accuracy in this work, we follow the same procedure validated in the SAMI DR3 pipeline and find the correction to be robust for the typical wavelength regions of interest. A more detailed evaluation will be explored in future releases.

2.5 Cubing

Each field is observed by offsetting the telescope between each of seven 1 800 s frames in a dither pattern. The offsets are 0.4–0.7 arcsec with a central position and 6 radial positions. This pattern was optimised originally for SAMI in Sharp et al. (Reference Sharp2015), and is driven primarily by the site seeing and fibre size that remain the same for Hector.

The seven dithered RSS frames are centred and aligned prior to cubing. In each frame, object centres are determined by fitting two-dimensional Gaussians across the field, with a mask applied to minimise contamination from nearby stars or secondary objects within the same bundle. Mean offsets relative to the reference (first) frame are then computed from the measured positions and applied to align the frames. Remaining bad pixels, including broadened cosmic rays affected by charge diffusion (particularly in the thick red CCD2 detector), are removed at this stage using sigma clipping, based on comparisons of fibre spectra across multiple frames. This procedure follows the approach described in the SAMI DR3 paper (Croom et al. Reference Croom2021). The aligned frames are then combined into a three-dimensional data cube, preserving both spatial and spectral information through a drizzle-like algorithm originally introduced by Fruchter & Hook (Reference Fruchter and Hook2002) for imaging data (e.g. Koekemoer et al. Reference Koekemoer2011), and later adapted by Sharp et al. (Reference Sharp2015) for SAMI IFS cubing. Separate blue and red cubes are generated for each target, corresponding to data from the blue (CCD1 and CCD3) and red (CCD2 and CCD4) spectrograph arms.

This drizzle-like approach is conceptually similar to the flux redistribution scheme first introduced by Sánchez et al. (Reference Sánchez2012) for CALIFA, which uses a truncated Gaussian kernel and has since been adopted by other IFS surveys such as MaNGA (Law et al. Reference Law2016) and CAVITY (García-Benito et al. Reference García-Benito2024). Both approaches aim to reconstruct regularly gridded data from irregular fibre positions, but differ in their choice of kernel and resampling strategy. The drizzle-like method assigns uniform weight within the drop footprint, preserving fine spatial structure, whereas the Gaussian kernel applies distance-dependent weights that decrease from the fibre centre, resulting in smoother and more stable sampling. While both methods are effective, these differences can lead to variations in spatial resolution and noise properties in the final datacubes.

Applying a drizzle-like algorithm requires specifying the drop size, defined as the effective footprint of a fibre projected onto the output spaxel grid during resampling. The SAMI Survey employed a drop size of 0.8 arcsec, corresponding to 50% of the 1.6-arcsec fibre size, to recover the intrinsic spatial resolution. Hector shares the same fibre size as SAMI but differs significantly in several key aspects, including throughput, spectral resolution, and wavelength coverage. These differences, along with the importance of optimising spatial sampling, call for a comprehensive evaluation of drop sizes to achieve an optimal balance between S/N and spatial resolution recovery. To address this, we tested various drop sizes specifically for Hector data by generating test cubes with drop sizes of 0.8 arcsec (50%; SAMI-like), 1.2 arcsec (75%), and 1.6 arcsec (100%; full fibre size) for a sample of 134 secondary standard stars observed in 2023.

Figure 15 presents a detailed comparison of the impact of these drop sizes on spatial resolution, PSF recovery, and overall data quality. The PSF FWHM of the stars is measured from the g-band images generated using the test cubes. The S/N is calculated as the median of the S/N values for each spaxel within the central 3 arcsec $^2$ region ( $\sim$ 36 spaxels). Smaller drop sizes provide better spatial resolution, as evidenced by the smaller FWHM values in the top and middle panels. In the middle panel, we show the ratio of FWHM values for the 75% and 100% drop sizes relative to the 50% drop size as a function of the input FWHM, to assess the impact of selecting larger drop sizes compared to the SAMI standard. The median $FWHM_{75}$ / $FWHM_{50}$ and $FWHM_{100}$ / $FWHM_{50}$ are 1.043 and 1.099, respectively, at an input FWHM of 2 arcsec, which corresponds to the typical PSF observed with Hector. The FWHM ratios appear to decrease slightly with increasing input FWHM, indicating that the differences between drop sizes become less significant for larger input FWHM values. For instance, when the input FWHM exceeds 2.5 arcsec, there is no strong evidence to suggest that using a 50% drop size yields noticeably better FWHM compared to a 75% drop size.

Figure 15. Comparison of the effects of drop sizes (50%, 75%, and 100%) on spatial resolution and S/N in Hector data reduction. (a) Output FWHM measured from the cubes as a function of input FWHM, measured as the median FWHM of RSS frames before cubing, for drop sizes of 50% (black circles), 75% (orange triangles), and 100% (blue diamonds). The diagonal line represents a one-to-one relationship. Smaller drop sizes result in slightly better spatial resolution (smaller FWHM). (b) FWHM ratios relative to the 50% drop size as a function of input FWHM. Larger drop sizes consistently produce higher FWHM values, with the difference becoming less pronounced for larger input FWHM. (c) S/N ratios relative to the 50% drop size as a function of S/N for the 50% drop size. Larger drop sizes result in significantly improved S/N, highlighting the trade-off between spatial resolution and S/N in the data reduction process.

While smaller drop sizes improve spatial resolution, they reduce the S/N per spaxel, even though the total S/N across the object remains conserved. This apparent reduction in per-spaxel S/N arises from the redistribution of signal across more spaxels, which increases the correlation between neighbouring spaxels without introducing additional noise or reducing the total flux. Larger drop sizes (75% and 100%) improve S/N per spaxel, as shown in the bottom panel, where the median $SN_{75}$ / $SN_{50}$ and $SN_{100}$ / $SN_{50}$ are 1.311 and 1.653, respectively. This is because larger drop sizes collect more flux per spaxel, at the expense of spatial resolution. Our results highlight a fundamental trade-off between spatial resolution and S/N per spaxel. Smaller drop sizes are advantageous for applications that require high spatial resolution, whereas larger drop sizes maximise per-spaxel S/N, which is beneficial for a broader range of analyses that are performed on a per-spaxel basis. Larger drop sizes also tend to produce more uniform weight maps and reduce the risk of gaps caused by imperfect dithering. Assessing this trade-off, we chose to adopt a drop size of 1.2 arcsec (75%) for the drizzle-like cubing algorithm for Hector reduction, as it achieves a 30% gain in per-spaxel S/N while incurring only a 4% increase in FWHM under typical Hector observing conditions.

As the next step, the flux (C), variance (V), and weight (W) cubes were scaled using a scale factor, $\zeta$ of 0.75, corresponding to a drop size of 1.2 arcsec. The scaling was applied as follows: $C^\prime$ = $C/\zeta^2$ , $V^\prime$ = $V/\zeta^4$ , and $W^\prime$ = $W/\zeta^2$ . This adjustment ensures that the scaled cubes accurately reflect the changes introduced by the smaller drop size, preserving the consistency of flux, variance, and weight across the data cube.

Each spaxel in the data cube is associated with a variance value that quantifies the uncertainty in the flux at that spatial and spectral position. This variance primarily reflects the combined contribution of Poisson noise from the object and sky, read noise, and uncertainties propagated through the flat-fielding, wavelength calibration, and sky subtraction steps. Although the individual components of the error budget are not explicitly separated, their cumulative effect is empirically propagated through the pipeline. This approach follows the method used in the SAMI pipeline (Sharp et al. Reference Sharp2015; Allen et al. Reference Allen2015). Similar strategies for empirical variance propagation have also been adopted and validated in other IFS surveys, including CALIFA (Husemann et al. Reference Husemann2013; García-Benito et al. Reference García-Benito2015) and MaNGA (Law et al. Reference Law2016).

In addition to per-spaxel variance, drizzle resampling introduces inter-spaxel covariance due to the partial overlap of fibre footprints on the output grid. This covariance affects the interpretation of spatially resolved parameter maps, and should be considered when fitting smooth models (e.g. velocity fields, stellar population gradients) or integrating over multiple spaxels. While variance provides local uncertainty estimates, the inter-spaxel covariance contributes to the total uncertainty in quantitative analyses.

The covariance is estimated during cube reconstruction, following the implementation in Section 5.7 of Sharp et al. (Reference Sharp2015), and is stored as a 5D array indexed by spatial coordinates (x,y), relative spatial offsets (dx,dy), and wavelength slice. The use of a larger drop size is expected to increase the degree of covariance due to greater overlap in the resampling process (Fruchter & Hook Reference Fruchter and Hook2002). Under the simplifying assumption of uniform sampling, the spatial covariance in a drizzled data cube is expected to scale with the square of the drop size (i.e. $\propto \zeta^2$ ), reflecting the increasing overlap of fibre footprints in the resampling process. Accordingly, increasing the drop size from 0.5 to 0.75 should increase the inter-spaxel covariance by a factor of $(0.75/0.5)^2 = 2.25$ . To verify this, we compare the covariance structures estimated from example stellar cubes reconstructed with drop sizes of 0.5 and 0.75. For each cube, we perform a median combine along the wavelength axis to construct a 4D representation of the spatial covariance structure. We then calculate the average covariance for the eight immediately adjacent neighbours in both cubes. The resulting covariance ratios ( $Covar_{75}$ / $Covar_{50}$ ) are shown in Figure 16, and demonstrate a consistent increase in covariance for the larger drop size, with the ratios lying in the range 1.43–2.63, which is broadly consistent with the theoretical expectation of a $\zeta^2$ scaling.

Figure 16. Ratio of median covariance values between data cubes reconstructed with drizzle drop sizes of 0.75 and 0.5. Each pixel represents the average covariance ratio ( $Covar_{75}$ / $Covar_{50}$ ) between a central spaxel and its surrounding neighbour at a given spatial offset $(\Delta x, \Delta y)$ . The observed enhancement in covariance for the larger drop size is broadly consistent with the expected $\zeta^2 = 2.25$ scaling from drizzle resampling.

Unlike SAMI hexabundles, which have a fixed 61 fibres per bundle and produce cubes with uniform 50 by 50 spaxels, Hector features bundles with varying fibre counts, ranging from 37 to 169, offering greater spatial coverage of targeted galaxies. As a result, the spatial size of Hector cubes varies depending on the bundle configuration, while the x and y dimensions remain consistent, with each spaxel uniformly sized at $0.5 \times 0.5$ arcsec. Despite the variation in size, the target is always centred within the cube, and its nominal coordinates are assigned using the WCS. Cubes from AAOmega bundles (A–H) contain 2 048 wavelength slices, identical to SAMI cubes, whereas cubes from Spector bundles (I–U) contain 4 096 wavelength slices, enabling finer spectral resolution.

In addition to the default cubes, we produced binned cubes using three binning schemes implemented in the SAMI data reduction pipeline (Allen et al. Reference Allen2014): adaptive binning based on the Voronoi method (Cappellari & Copin Reference Cappellari and Copin2003), annular binning into five elliptical annuli, and sector binning, which further subdivides the annuli azimuthally into equal-area regions. For all binning schemes, the variance of each binned spectrum was calculated by propagating the individual spaxel variances and applying a wavelength-dependent correction factor derived from the covariance. This correction accounts for the increased noise resulting from correlated spaxels within each bin, and is computed using the relative spatial offsets of spaxels and their associated covariance maps.

3. Verification of early science data

In this section, we highlight the data quality and key features of the Hector early science data set, which comprises observations of 1 539 unique galaxies collected between April 2023 and October 2024, in support of early science studies.

3.1 S/N distribution

In Figure 17, we show the fraction of spaxels with S/N $\gt$ 5, within the effective radius, $R_\textrm{e}$ as a function of the surface brightness within the effective radius, $\mu_\textrm{e}$ , in r-band. The S/N per Å is calculated as the median flux divided by the square root of the variance, and normalised by the square root of the spectral dispersion (in Å). Since galaxy brightness contributes to the signal, S/N is partially correlated with $\mu_{e}$ . 62% of the sample have more than 90% of spaxels with S/N $\gt$ 5 within 1 $R_\textrm{e}$ , and 51% of galaxies reach 100%. With S/N $\gt$ 3, 77% of galaxies contain more than 90% of spaxels within 1 $R_\textrm{e}$ , and 67% of galaxies reach 100%. The higher number of galaxies in Spector compared to AAOmega is mainly due to the larger number of bundles in Spector. The majority of our sample provides sufficiently high S/N within 1 $R_\textrm{e}$ , ensuring reliable data quality for subsequent analysis.

Figure 17. The fraction of spaxels with S/N $\gt$ 5 within one effective radius as a function of the surface brightness within one effective radius ( $\mu_{e}$ ) in r-band. The histogram shows the distribution of the fraction. The filled and open circles and histograms are the galaxies observed from AAOmega and Spector, respectively.

Figure 18. PSF FWHM distribution measured from secondary standard stars, comparing AAOmega (dashed line) and Spector (solid line). This result confirms that the spatial resolution remains consistent between the two instruments, without artificial discrepancies introduced by the instrumentation.

Figure 19. The distribution (red points) of R.A. and Dec. offsets between Hector and Legacy Survey DR10. The blue open circle with an error bar is the median offset and its associated error (standard error based on MAD); values are shown in the lower left of the main panel. The top and right histograms and the blue dashed lines show the distributions and the medians for R.A. and Dec., respectively. The solid black lines are centred at zero for all panels. Each open black circle encloses the labelled fraction of galaxies (50%, 90%, and 95%).

3.2 Spatial resolution

For Hector, each tile configuration includes two dedicated secondary standard star bundles, one assigned to AAOmega and the other to Spector. We estimated the spatial resolution of galaxy cubes in each tile by measuring the PSF of secondary standard star cubes, which were simultaneously observed and processed into cubes in the same manner as the corresponding galaxy data.

Figure 20. The distribution of absolute misalignments between the position angles (PAs) estimated with MGEFit’s find_galaxy subroutine. The dark blue solid and dashed lines represent $\pm$ RMS and $\pm$ 2RMS. The inset panel presents the absolute misalignments as a function of ellipticity estimated for Legacy images. The blue vertical line is the lower limit we adopted for this analysis, and the red points highlight the cubes satisfying this criterion.

In Figure 18, we present the PSF distribution measured from 267 secondary standard star cubes included in the early science data, which also serves as a proxy for the spatial resolution of the galaxy cubes. For each stellar cube, we collapse the data into a 2D image and fit a Moffat profile to measure the PSF FWHM, yielding a median FWHM of 2.02 arcsec. We do not detect any significant differences in FWHM between AAOmega and Spector stellar cubes, indicating that the spatial resolution is consistent across both instruments.

Figure 21. Comparison of the AAOmega (purple) and Spector (green) spectra (integrated within 1.5 kpc, corresponding to 3.1 arcsec) for a Hector galaxy observed with both spectrographs (W43690869503589: RA = $42.9344^{\mathrm{o}}$ , DEC = $-31.4842^{\mathrm{o}}$ , $z=0.023$ ). The blue and red arms are shown in the top and bottom panels, respectively, showcasing the continuous coverage of Spector data across the full wavelength range, compared to the incomplete coverage of AAOmega. The PSF FWHM of the AAOmega and Spector observations are 2.34 and 1.84 arcsec, respectively, accounting for the systematic offset in flux between the two data sets. The $[\mathrm{OII}]$ , $\mathrm{H}\unicode{x03B2}$ , [NII], $\mathrm{H}\unicode{x03B1}$ , and [SII] emission lines are labelled, together with the NaD absorption line (present only in the Spector data). Inset panels show the wavelength ranges around these features; the background on the insets matches the highlighted regions for these features on the main diagrams.

3.3 Spectral resolution

We derive the FWHM of the spectral instrumental line spread function (LSF) by fitting Gaussians to arc lines in a total of 1997 Helium-CuAr-FeAr arc frames, taken between August 2022 and December 2024. For each optical fibre, we only fit unsaturated and unblended arc lines that account for 15, 19, 39, and 30 arc lines on CCDs 1 through 4, respectively. All results are combined into a three-dimensional array with wavelength, fibre number and observation date. To obtain the FWHM as a function of any one dimension, we collapse the array along the two other dimensions using a median. There are four different LSFs based on dependencies including CCD, hexabundle, wavelength, and hexabundle-wavelength. The effects of the LSFs on the stellar kinematics measurements will be investigated and outlined by Tuntipong et al. (in preparation).

We summarise the Hector spectral resolution at the central wavelengths in Table 1. We also compare the FWHMs of the AAOmega CCDs from the Hector survey with those from the SAMI survey. For CCD1 and CCD2, the FWHMs in the Hector survey are 2.55 and 1.52 Å, respectively, while those in the SAMI survey are 2.66 and 1.59 Å, respectively (Scott et al. Reference Scott2018). Furthermore, the FWHMs of the Spector CCDs are markedly smaller than the AAOmega CCDs, i.e. 1.40 and 1.20 Å in CCD3 and CCD4, respectively. Overall, Hector delivers a significant improvement in spectral resolution relative to SAMI.

3.4 WCS and orientation accuracy

Section 2.5 outlines the centring of Hector cubes. We assess the centring accuracy via cross-correlation between reconstructed Hector images and Legacy Survey g-band images (Dey et al. Reference Dey2019). Mock g-band images are generated from the Hector cubes using the DECam g-band filter response (Flaugher et al. Reference Flaugher2015). Legacy images are matched to Hector’s pixel resolution (0.5 $^{\prime\prime}$ /pix) and convolved with the PSF. A mask excludes regions outside the hexabundle to prevent contamination. Figure 19 presents the R.A. and Dec. offsets, with median values of 0.032 arcsec in R.A. and 0.022 arcsec in Dec. These measured centring offsets are consistent with the level expected from statistical fluctuations, based on the number of spaxels used in the centroiding. A total of 2 270 cubes ( $\sim 96\%$ ) have total offsets, defined as $R=\sqrt{(\Delta\alpha)^2 + (\Delta\delta)^2}$ , below 1 arcsec. Among the 102 remaining cubes, 52 are miscentred due to nearby objects (within the hexabundle) or incorrect coordinates, or incorrect orientation. Cross-correlation fails for 50 cubes, primarily due to galaxies faint in blue cubes or not fully imaged with the Legacy Survey. However, visually, the faint and missing-imaging cubes appear well-centred and were flagged as accurate. Overall, $\sim$ 97% of the cubes have accurate centring.

Figure 22. A kinematically twisted barred spiral galaxy (survey ID: C901005167309223) observed in one of the largest bundles (B) in AAOmega. Top row: From left to right, the log median flux from the blue cube, log median flux from the red cube, $\mathrm{H}\unicode{x03B2}$ and $\mathrm{H}\unicode{x03B1}$ emission line log flux, with lighter colours indicating higher fluxes. Middle row: The stellar velocity and velocity dispersion, the gas velocity and velocity dispersion, all in km s $^{-1}$ with accompanying colour bars in the lower left corner. For both the stellar and gas velocity, the median of the central $5\times5$ spaxels was subtracted from the velocity maps. Bottom row: Typical diagnostic ratios. From left to right, $\log($ [NII]/ $\mathrm{H}\unicode{x03B1})$ , $\log($ [OIII]/ $\mathrm{H}\unicode{x03B2})$ , and Balmer decrement. The bottom right panel is an optical image from the Legacy Survey DR9 (Dey et al. Reference Dey2019) with the hexabundle diameter (25.9 arcsec) shown by the red contour. The bar-like structure in the $\sigma_\textrm{gas}$ map is a kinematic feature aligned with the gas rotation axis and reflects non-circular motions or beam smearing near steep velocity gradients, rather than the stellar bar seen in the imaging and $\mathrm{H}\alpha$ -flux panels.

In SAMI, hexabundles were plugged to have fixed orientations across the plate. However, the Hector plate has three exits allowing us to plug the circular/rectangular magnets and hence hexabundles at any angle as shown in Figure 4. This means that the hexabundles (and therefore the galaxy data) will have a range of rotations relative to the telescope’s reference frame. Therefore, the orientations must be standardised across the plate to be North up and East left, by reverting the rotation. The orientation correction is done based on the input Hector robot files.

As an indirect way to test this, we examine cube orientations by comparing position angles (PA) estimated using the find_galaxy subroutine of the MGEFit code (Cappellari Reference Cappellari2002) for both reconstructed Hector images and Legacy Survey g-band images. To account for PA symmetry, we define the smallest absolute misalignment as $|\Delta\text{PA}| = \text{min}(|\Delta\text{PA}|, 180 -|\Delta\text{PA}|)$ , constraining $|\Delta\text{PA}|$ to [0, 90] degrees. The inset panel in Figure 20 shows an apparent trend of misalignment with ellipticity, but this is due to the larger uncertainties on PA for rounder objects, which means galaxies with lower ellipticity tend to have large PA differences due to unconstrained PA. Focusing on well-centred cubes with Legacy imaging, we applied a cut based on the ellipticities of Legacy images to exclude very round objects ( $\varepsilon_{Legacy}\leq0.15$ ), yielding 1 692 cubes for analysis. The main panel of Figure 20 presents the distribution of misalignment. The RMS misalignment is $\sim$ 7 degrees (solid dark blue line), with 90% and 97% of this sample aligned to within $\pm$ RMS and $\pm$ 2RMS, respectively. The misalignment arises from similar issues described in the previous paragraph for centring (i.e. problems with the data rather than the orientation correction being wrong).

Figure 23. A counter-rotating galaxy (ID: W183970774910266) observed in bundle P (diameter 15.5 arcsec) of Spector. The panels are the same as Figure 22.

3.5 Example data

3.5.1 Spectra

In Figure 21, we show a comparison between the AAOmega and Spector integrated spectra (within 1.5 kpc, corresponding to 3.1 arcsec) for a galaxy observed with both instruments (ID:W43690869503589). The small flux offset between the two spectra originates from differences in seeing conditions between the two observations: the AAOmega spectrum was taken with a PSF FWHM of 2.34 arcsec, which results in a fainter flux within a small aperture compared to the Spector spectrum, which was observed with a FWHM of 1.8 arcsec (see Figure 8).

Figure 21 shows the continuous wavelength coverage of Spector, highly desirable for full spectral fitting, that is not available for AAOmega data. In particular, Spector data covers the wavelength range 5 787–6 296 Å, a region that is not sampled by AAOmega. This part of the galaxy’s spectrum includes the NaD absorption line doublet (highlighted in the orange inset panel in Figure 21), a strong indicator of neutral gas and a useful tracer of galactic inflows and outflows.

Spectra from Spector exhibit significantly sharper emission lines than those from AAOmega, particularly in the blue arm, where the spectral resolution is higher by a factor of approximately 1.8 (compared to 1.3 in the red; see Table 1). This improvement is clearly demonstrated in the $\mathrm{H}\beta$ emission line, which appears much sharper in Spector data than in AAOmega. In contrast, the difference is less pronounced in $\mathrm{H}\alpha$ , where both instruments provide comparably high resolution. Spector data also resolve the [OII] $\lambda\lambda$ 3726, 3729 doublet, which appears blended in the AAOmega spectra. This enables more accurate flux measurements of the individual [OII] doublet lines, which is particularly important for electron density diagnostics in the H ii region. In addition, the improved resolution facilitates the study of complex emission line profiles. While such features (e.g. emission line profiles characterised by multiple Gaussian components) have been identified in AAOmega spectra taken for the SAMI survey, it was pointed out by Zovaro et al. (Reference Zovaro2024) that the lower resolution of AAOmega in the blue arm results in unreliable flux measurements for emission lines in this wavelength range (most notably, $\mathrm{H}\beta$ and [OIII]).

3.5.2 Kinematic and emission-line maps

To demonstrate the quality of the Hector data, we show three example galaxies that are kinematically interesting, one from AAOmega (Figure 22) and two from Spector (Figures 23 and 24). For each galaxy, we display maps of the continuum, $\mathrm{H}\alpha$ and $\mathrm{H}\unicode{x03B2}$ flux maps, stellar and gas kinematics, and typical diagnostic flux ratios. The stellar kinematics are fitted using pPXF with a Gaussian LOSVD and 12th-order additive Legendre polynomial, similar to the method in SAMI van de Sande et al. (Reference van de Sande2017). The stellar continuum is fitted using the kinematics and a 12th-order multiplicative polynomial, which is subtracted from the spectrum so that a multi-Gaussian component emission line fit can be performed. For simplicity, only single-component Gaussian fits are shown here. A more detailed description of the pipeline used to generate these products will be provided in Quattropani et al. (in preparation).

Figure 24. A galaxy with a kinematically decoupled core (ID: W42700250208413) observed in bundle N (diameter 15.5 arcsec) of Spector. The panels are the same as Figure 22.

C901005167309223 (Figure 22) is a massive barred galaxy observed with one of the largest AAOmega bundles, with a diameter of 25.9 arcsec. The galaxy exhibits a mild kinematic twist in both the stellar and ionised gas velocity fields, which can only be detected with such extended spatial coverage. Low [N ii]/H $\alpha$ ratios and the Balmer decrement trace a star-forming ring, while elevated central line ratios suggest the presence of a active galactic nucleus (AGN) or shock ionisation.

W183970774910266 (Figure 23) is a low-mass star-forming galaxy with a stellar mass of $\log(M_*/M_{\odot}) = 8.95$ . The high spectral resolution of Spector enables reliable kinematic measurements even in such low-mass systems. The kinematic maps reveal a clear counter-rotation between the ionised gas and stars, although the stellar velocity field appears more irregular and only weakly rotating.

W42700250208413 (Figure 24) is an intermediate-mass early-type galaxy observed with Spector. The gas velocity map shows regular rotation, while the stellar velocity field reveals a prominent kinematically decoupled core (KDC), with the inner region rotating in a direction misaligned with the outer stellar body. The stellar velocity dispersion map also displays two off-centre peaks, indicative of a complex assembly history for this galaxy.

4. Summary and conclusions

In this paper we present the data reduction process and validation tests for the Hector Galaxy Survey. Our data reduction pipeline, adapted from the SAMI Galaxy Survey, incorporates significant modifications to accommodate the complexities of the Hector instrument, which features four CCDs fed by two distinct spectrographs, AAOmega and Spector. These spectrographs differ in detector format, spectral resolution, fibre bundle sizes, wavelength coverage, and throughput efficiency, necessitating tailored approaches for data processing and calibration.

We highlight some new and enhanced features of Hector Galaxy Survey data:

• Two-dimensional wavelength calibration: We introduced a 2D arc-fitting approach that delivers more accurate wavelength solutions across the entire detector, significantly enhancing spectral resolution and reducing systematic errors compared to per-fibre methods. This method yields RMS velocity scatter values of 2.7, 1.3, 1.2, and 1.9 km s $^{-1}$ for CCDs 1 through 4, respectively, corresponding to improvements by a factor of 1.2–3.4 relative to per-fibre fitting.
• Chromatic variation in distortion corrections: We constructed a 3D map of the chromatic variation in distortion across the 2-degree field of view of Hector as a function of plate position (robotic x- and y-coordinates) and wavelength, using stellar observations taken during the instrument commissioning. The distortion is modelled using a polynomial function of the field radius ( $\alpha$ ), incorporating odd-power terms up to $\alpha^7$ to characterise its variation across the plate and a quadratic function to parameterise the wavelength dependence. This model is integrated into the data reduction pipeline, where it is applied to the extraction of primary and secondary standard stars, the alignment of dithered frames, and the extraction of galaxy data to generate spectral cubes.
• Cubing drop size: We evaluated different drop sizes for the drizzle-like cubing algorithm, balancing the trade-off between spatial resolution and S/N per spaxel. We adopt a 1.2 arcsec (75%) drop size, achieving a 30% gain in S/N with only a 4% increase in FWHM under typical Hector observing conditions.
• Higher spectral resolution and wavelength coverage: The Spector spectrograph offers higher spectral resolution (1.4 Å in the blue arm and 1.2 Å in the red arm) compared to AAOmega (2.55 and 1.52 Å; see Table 1). This enhancement enables more precise kinematic and emission-line studies, particularly benefitting research on low-mass galaxies and cold stellar disks. Furthermore, the Spector data offers broader wavelength coverage (3 750–7 800 Å) compared to AAOmega, which covers 3 750–5 750 Å and 6 300–7 400 Å, as shown in Section 2.3.5 and Figure 21. Spector’s wider, and continuous, spectral coverage samples the NaD absorption line doublet, not available in AAOmega data.

This paper presents examples demonstrating the excellent quality of Hector data and its reach and power for enabling a wide range of science. The examples provided showcase Hector’s higher spectral resolution, broad wavelength coverage, and improved spatial sampling, offering critical insights into galaxy kinematics, stellar populations, and emission-line diagnostics. These data illustrate the capabilities of the current data reduction pipeline and provide a promising foundation for future extragalactic and astrophysical science enabled by Hector.

Acknowledgements

The Hector Galaxy Survey is based on observations made at the Anglo-Australian Telescope. We acknowledge the traditional owners of the land on which the AAT stands, the Gamilaraay people, and pay our respects to elders past and present.

We extend our sincere thanks to the staff at Siding Spring Observatory for their unwavering dedication, expertise, and consistent commitment during the commissioning and operational phases of the Hector instrument and for their continued expertise, time and hard work in maintaining the instrument and supporting the ongoing Hector Galaxy survey. This project would not have been successful without the tireless efforts of many individuals, including Ian Adams, Ashley Anderson, Nadim El-Saleh, Gerard Hutchinson, Chris Lidman, Glen Murphy, Murray Riding, and Zachariah Smith.

The Hector multi-object integral field spectrograph instrument was built jointly by the University of Sydney and Macquarie University nodes of the Astralis Astronomical Instrumentation Consortium (https://astralis.org.au/), with additional financial contributions from the Australian National University and University of Western Australia and support from the Australian Research Council through grants LE170100242, LE190100018 and FT180100231. The Hector input catalogue is based on data taken from the WAVES Survey, Sloan Digital Sky Survey, GAMA Survey, 2dF Galaxy Redshift Survey, and Skymapper Southern Sky Survey. The Hector Galaxy Survey website is https://hector.survey.org.au/. The Hector Galaxy Survey makes use of Data Central services (datacentral.org.au). The authors acknowledge the use of computing resources provided by Sukyoung Yi at Yonsei University for data processing and analysis.

Data availability statement

The data used in this study were obtained from the Hector Galaxy Survey and are currently proprietary. The data will be made publicly available in an upcoming data release.

Author contributions

SO and MLPG led the Hector Data Reduction (DR) Working Group, developed the DR pipeline, and devised the project. SMC, GQ, ST, JJB, PC, PKD, OÇ, JHL, AR, SB, MP, SMS, TJW, TR, YM, and MSO contributed to the pipeline’s development through data analysis, quality control, and verification. JJB leads the Hector Galaxy Survey and led the build of the Hector instrument. Most authors contributed through observations for the Hector Galaxy Survey and/or participation in instrument development. All authors reviewed and provided feedback on the manuscript.

Funding statement

The Hector Galaxy Survey research is supported by the Australian Research Council Centre of Excellence for All Sky Astrophysics in 3 Dimensions (ASTRO3D), through project number CE170100013, and other participating institutions. SO acknowledges support from the Korean National Research Foundation (NRF) (RS-2023-00214057; RS-2025-00514475), as well as ongoing support from DL. MLPG acknowledges support from the ARC grant DP190102714. JHL acknowledges support from the Korea Astronomy and Space Science Institute under the R&D program (Project No. 2025-1-831-01), supervised by the Korea AeroSpace Administration, and from the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2022R1A2C1004025). CF is the recipient of an Australian Research Council Future Fellowship (project number FT210100168) and Discovery Project DP210101945 funded by the Australian Government. JC acknowledges support from the Basic Science Research Program through the National Research Foundation (NRF) of Korea (2022R1F1A107287) and Global-LAMP Program of the National Research Foundation of Korea (NRF) grant funded by the Ministry of Education (No. RS-2023-00301976). JJB acknowledges funding from the Australia Research Council through grant FT180100231. KG is supported by the Australian Research Council through the Discovery Early Career Researcher Award (DECRA) Fellowship (project number DE220100766) funded by the Australian Government. KO acknowledges support from the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (RS-2025-00553982). MMC acknowledges support from a Royal Society Wolfson Visiting Fellowship (RSWVF\R3\223005) at the University of Oxford. SB acknowledges the support from the Physics Foundation through the Messel Research Fellowship. SMS acknowledges funding from the Australian Research Council (DE220100003). YM and GQ are supported by an Australian Government Research Training Program (RTP) Scholarship. AR acknowledges that this research was carried out while the author was in receipt of a Scholarship for International Research Fees (SIRF) at The University of Western Australia. OC is supported by an Australian Government Research Training Program Scholarship for international graduate research students (iRTP). ST acknowledges the support from the Royal Thai Government Scholarship and the University of Sydney Postgraduate Research Supplementary Scholarship.

Footnotes

^a https://www.aao.gov.au/science/software/2dfdr; see also http://www.ascl.net/1505.015.

^b https://snfactory.lbl.gov/snf/snf-specstars.html.

^c https://classic.sdss.org/dr7/products/spectra/spectrophotometry.php.

^d https://population-synthesis-toolkit.readthedocs.io/en/latest/

References

AAO Software Team. 2015, 2dfdr: Data reduction software, Astrophysics Source Code Library, ascl:1505.015Google Scholar

Aldering, G., et al. 2002, in Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, Vol. 4836, Survey and Other Telescope Technologies and Discoveries, ed. Tyson, J. A., & Wolff, S., 61Google Scholar

Allen, J. T., et al. 2014, SAMI: Sydney-AAO Multi-object Integral field spectrograph pipeline, Astrophysics Source Code Library, ascl:1407.006Google Scholar

Allen, J. T., et al. 2015, MNRAS, 446, 1567Google Scholar

Bacon, R., et al. 2001, MNRAS, 326, 23Google Scholar

Bland-Hawthorn, J., et al. 2011, OEx, 19, 2649Google Scholar

Brown, R., Wang, A. H., Bryant, J. J., & Leon-Saval, S. 2018, in Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, Vol. 10706, Advances in Optical and Mechanical Technologies for Telescopes and Instrumentation III, ed. Navarro, R., & Geyl, R., 1070663Google Scholar

Bryant, J. J., et al. 2015, MNRAS, 447, 2857Google Scholar

Bryant, J. J., et al. 2024, in SPIE Conference Series, Vol. 13096, Ground-based and Airborne Instrumentation for Astronomy X, ed. Bryant, J. J., Motohara, K., & Vernet, J. R. D., 130960DGoogle Scholar

Bryant, J. J., Bland-Hawthorn, J., Fogarty, L. M. R., Lawrence, J. S., & Croom, S. M. 2014, MNRAS, 438, 869Google Scholar

Bundy, K., et al. 2015, ApJ, 798, 7Google Scholar

Calzetti, D., et al. 2000, ApJ, 533, 682Google Scholar

Cappellari, M. 2002, MNRAS, 333, 400Google Scholar

Cappellari, M., et al. 2011, MNRAS, 413, 813Google Scholar

Cappellari, M. 2016, ARA&A, 54, 597Google Scholar

Cappellari, M. 2017, MNRAS, 466, 798Google Scholar

Cappellari, M., & Copin, Y. 2003, MNRAS, 342, 345Google Scholar

Cappellari, M., & Emsellem, E. 2004, PASP, 116, 138Google Scholar

Childress, M. J., Vogt, F. P. A., Nielsen, J., & Sharp, R. G. 2014, Ap&SS, 349, 617Google Scholar

Conroy, C. 2013, ARA&A, 51, 393Google Scholar

Corcho-Caballero, P., Ascasibar, Y., & Jiménez-López, D. 2025, JOSS, 10, 8203Google Scholar

Croom, S. M., et al. 2012, MNRAS, 421, 872Google Scholar

Croom, S. M., et al. 2021, MNRAS, 505, 991Google Scholar

Dey, A., et al. 2019, AJ, 157, 168Google Scholar

Driver, S. P., et al. 2016, in Astrophysics and Space Science Proceedings, Vol. 42, The Universe of Digital Sky Surveys, ed. Napolitano, N. R., Longo, G., Marconi, M., Paolillo, M., & Iodice, E., 205Google Scholar

Flaugher, B., et al. 2015, AJ, 150, 150Google Scholar

Foster, C., et al. 2021, PASA, 38, e031Google Scholar

Fruchter, A. S., & Hook, R. N. 2002, PASP, 114, 144Google Scholar

García-Benito, R., et al. 2015, A&A, 576, A135Google Scholar

García-Benito, R., et al. 2024, A&A, 691, A161Google Scholar

Green, A. W., et al. 2018, MNRAS, 475, 716Google Scholar

Husemann, B., et al. 2013, A&A, 549, A87Google Scholar

Kaur, G., Bilicki, M., Hellwing, W., & The WAVES Team. 2025, arXiv e-prints, arXiv:2502.20983 Google Scholar

Kausch, W., et al. 2015, A&A, 576, A78Google Scholar

Kewley, L. J., & Ellison, S. L. 2008, ApJ, 681, 1183Google Scholar

Koekemoer, A. M., et al. 2011, ApJS, 197, 36Google Scholar

Kurucz, R. L. 1992, in IAU Symposium, Vol. 149, The Stellar Populations of Galaxies, ed. B. Barbuy & A. Renzini, 225Google Scholar

Law, D. R., et al. 2015, AJ, 150, 19Google Scholar

Law, D. R., et al. 2016, AJ, 152, 83Google Scholar

Law, D. R., et al. 2021, AJ, 161, 52Google Scholar

Lewis, I. J., et al. 2002, MNRAS, 333, 279Google Scholar

Pedregosa, F., et al. 2011, JMLR, 12, 2825Google Scholar

Sánchez, S. F. 2006, AN, 327, 850Google Scholar

Sánchez, S. F., et al. 2012, A&A, 538, A8Google Scholar

Sánchez, S. F. 2020, ARA&A, 58, 99Google Scholar

Scott, N., et al. 2018, MNRAS, 481, 2299Google Scholar

Sharp, R., et al. 2006, in Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, Vol. 6269, Ground-based and Airborne Instrumentation for Astronomy, ed. I. S. McLean & M. Iye, 62690GGoogle Scholar

Sharp, R., et al. 2015, MNRAS, 446, 1551Google Scholar

Smette, A., et al. 2015, A&A, 576, A77Google Scholar

van de Sande, J., et al. 2017, ApJ, 835, 104Google Scholar

van Dokkum, P. G. 2001, PASP, 113, 1420Google Scholar

Walcher, J., Groves, B., Budavári, T., & Dale, D. 2011, Ap&SS, 331, 1Google Scholar

Wang, A. H., Brown, R., Bryant, J. J., & Leon-Saval, S. 2019, in Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, Vol. 11115, UV/Optical/IR Space Telescopes and Instruments: Innovative Technologies and Concepts IX, ed. Barto, A. A., Breckinridge, J. B., & Stahl, H. P., 1111509Google Scholar

Wang, A. H., Brown, R., Bryant, J. J., & Leon-Saval, S. 2020, in Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, Vol. 11447, Ground-based and Airborne Instrumentation for Astronomy VIII, ed. Evans, C. J., Bryant, J. J., & Motohara, K., 114478GGoogle Scholar

Wang, A. H., Brown, R., Bryant, J. J., & Leon-Saval, S. 2023, MNRAS, 522, 4310Google Scholar

Wisnioski, E., et al. 2015, ApJ, 799, 209Google Scholar

Yan, R., et al. 2016, AJ, 152, 197Google Scholar

Zovaro, H. R. M., et al. 2024, MNRAS, 527, 8566Google Scholar

Table 1. A summary of Hector spectral resolution at the central wavelengths $\lambda_{central}$. This table provides data for all four CCDs including wavelength coverage ($\lambda_{range}$) in Å, central wavelength $\lambda_{central}$ in Å, median FWHM of best-fit Gaussian to the instrumental LSF (FWHM) in Å, median standard deviation of the Gaussian fit ($\sigma$) in Å, spectral resolution at $\lambda_{central}$ ($R_{\lambda_{central}}$), velocity resolution (FWHM) in km s$^{-1}$, and dispersion resolution ($1\sigma$) in km s$^{-1}$.

Figure 1. The result of 2D wavelength calibration for an example arc frame from CCD3 (frame 19, 28 October 2024). (a) Histogram of residuals from the model fit, defined as (measured arc line wavelength) – (model wavelength). The dotted lines mark $\pm0.1$ pixels on the detector. (b) Residuals across the detector. (c) Residuals as a function of detector x pixel (i.e. the vertical collapse of panel (b)). Small red points are individual line measurements, various coloured connected points are locally averaged residuals in a 10$\times$10 grid across the detector. (d) Small red points are residuals as a function of y detector pixel (i.e. the horizontal collapse of panel (b)). Coloured points are average residuals, as for (c).

Figure 4. Stellar observations illustrating the effects of chromatic variations in distortion (CVD) are shown as a function of wavelength and position across the Hector plate, presented in the coordinate system used by the Hector robot. Black-filled circles mark the stellar centroid at a reference wavelength of 6 000 Å, while coloured points trace the shift in the centroids of stellar observations across wavelength, shifting from redder to bluer wavelengths (red-to-blue filled-in circles) relative to the centroid at the reference wavelength. For clarity, the centroid shifts due to CVD effects are exaggerated by a factor of 20; the maximum shift is $\sim$120 $\unicode{x03BC}$m (1.17 times the fibre core diameter). For several hexabundles, we also illustrate the hexabundle orientation and cable direction (see Section 3.4 for discussion on the orientation of hexabundles and associated corrections). Grey lines connect the physical centres of each hexabundle to the centre of the Hector plate.

Figure 5. Modelling the Chromatic Variation in Distortion across the Hector plate. (a) Distortion as a function of position along the Plate y-coordinate across the Hector plate, from left-to-right, as shown in Figure 4. Also, as in Figure 4, the colour gradient from blue to red represents measured centroid offsets as a function of wavelength. The modelled distortion at wavelengths of 3 730 and 7 330 Å is shown as solid blue and red lines, respectively. (b) Residuals between the model and observed distortions at 3 800, 5 000, and 7 200 Å, demonstrating that the model effectively reproduces the measured distortions across the Hector plate to approximately within $\pm 10 \unicode{x03BC}$m. (c) RMS of the residuals as a function of radius on the Hector plate, with colours indicating increasing wavelength from blue to red. (d) RMS of the residuals as a function of wavelength, illustrating that RMS progressively becomes larger towards bluer wavelengths.

Figure 7. Sky to detector throughput achievable by Hector in the best conditions. Spector shows significantly higher throughput in both the blue and red arms relative to AAOmega.

Figure 13. Statistics of the blue-red flux difference, using the blue- and red-arm spectra integrated within a central 3-arcsec radius in the data cube for each galaxy observed using Spector. (a) Blue-red flux difference in percentage for all Spector galaxy spectra without a S/N cut. The number of blue-red spectra sets is given in parentheses. Note that the number of spectra exceeds the number of galaxies because some galaxies were observed multiple times. (b) Spector galaxy spectra with S/N $\geq$ 10. The values at the bottom show the median $\pm$ half the range between 16 and 84 percentiles at each wavelength point ($\lambda$-point; as defined in Figure 12). (c) The same as (b), but the blue-red flux difference is divided by the propagated noise at each $\lambda$-point, not by the red-arm flux at 5 800 Å.

Figure 16. Ratio of median covariance values between data cubes reconstructed with drizzle drop sizes of 0.75 and 0.5. Each pixel represents the average covariance ratio ($Covar_{75}$/$Covar_{50}$) between a central spaxel and its surrounding neighbour at a given spatial offset $(\Delta x, \Delta y)$. The observed enhancement in covariance for the larger drop size is broadly consistent with the expected $\zeta^2 = 2.25$ scaling from drizzle resampling.

Figure 17. The fraction of spaxels with S/N $\gt$ 5 within one effective radius as a function of the surface brightness within one effective radius ($\mu_{e}$) in r-band. The histogram shows the distribution of the fraction. The filled and open circles and histograms are the galaxies observed from AAOmega and Spector, respectively.

Figure 20. The distribution of absolute misalignments between the position angles (PAs) estimated with MGEFit’s find_galaxy subroutine. The dark blue solid and dashed lines represent $\pm$RMS and $\pm$2RMS. The inset panel presents the absolute misalignments as a function of ellipticity estimated for Legacy images. The blue vertical line is the lower limit we adopted for this analysis, and the red points highlight the cubes satisfying this criterion.

Figure 21. Comparison of the AAOmega (purple) and Spector (green) spectra (integrated within 1.5 kpc, corresponding to 3.1 arcsec) for a Hector galaxy observed with both spectrographs (W43690869503589: RA = $42.9344^{\mathrm{o}}$, DEC = $-31.4842^{\mathrm{o}}$, $z=0.023$). The blue and red arms are shown in the top and bottom panels, respectively, showcasing the continuous coverage of Spector data across the full wavelength range, compared to the incomplete coverage of AAOmega. The PSF FWHM of the AAOmega and Spector observations are 2.34 and 1.84 arcsec, respectively, accounting for the systematic offset in flux between the two data sets. The $[\mathrm{OII}]$, $\mathrm{H}\unicode{x03B2}$, [NII], $\mathrm{H}\unicode{x03B1}$, and [SII] emission lines are labelled, together with the NaD absorption line (present only in the Spector data). Inset panels show the wavelength ranges around these features; the background on the insets matches the highlighted regions for these features on the main diagrams.

Figure 22. A kinematically twisted barred spiral galaxy (survey ID: C901005167309223) observed in one of the largest bundles (B) in AAOmega. Top row: From left to right, the log median flux from the blue cube, log median flux from the red cube, $\mathrm{H}\unicode{x03B2}$ and $\mathrm{H}\unicode{x03B1}$ emission line log flux, with lighter colours indicating higher fluxes. Middle row: The stellar velocity and velocity dispersion, the gas velocity and velocity dispersion, all in km s$^{-1}$ with accompanying colour bars in the lower left corner. For both the stellar and gas velocity, the median of the central $5\times5$ spaxels was subtracted from the velocity maps. Bottom row: Typical diagnostic ratios. From left to right, $\log($[NII]/$\mathrm{H}\unicode{x03B1})$, $\log($[OIII]/$\mathrm{H}\unicode{x03B2})$, and Balmer decrement. The bottom right panel is an optical image from the Legacy Survey DR9 (Dey et al. 2019) with the hexabundle diameter (25.9 arcsec) shown by the red contour. The bar-like structure in the $\sigma_\textrm{gas}$ map is a kinematic feature aligned with the gas rotation axis and reflects non-circular motions or beam smearing near steep velocity gradients, rather than the stellar bar seen in the imaging and $\mathrm{H}\alpha$-flux panels.

Figure 23. A counter-rotating galaxy (ID: W183970774910266) observed in bundle P (diameter 15.5 arcsec) of Spector. The panels are the same as Figure 22.

Figure 24. A galaxy with a kinematically decoupled core (ID: W42700250208413) observed in bundle N (diameter 15.5 arcsec) of Spector. The panels are the same as Figure 22.

Article contents

Hector Galaxy Survey: Data processing, quality control, and early science

Abstract

Keywords

Information

1. Introduction

2. Data processing and quality control

2.1 Data reduction

2.1.1 Overscan and bias corrections

2.1.2 Read noise and gain

2.1.3 Bad pixel mask and cosmic ray rejection

2.1.4 Extraction of spectra and removal of scattered light

2.1.5 Wavelength calibration

2.1.6 Flat fielding

2.1.7 Correcting fibre-to-fibre variations in throughput

2.1.8 Sky subtraction

2.2 Chromatic variation in distortion correction

2.2.1 Distortion dependence on field radius and wavelength

2.3 Flux calibration

2.3.1 Primary flux calibration

2.3.2 Sky to detector throughput estimation

2.3.3 Secondary flux calibration

2.3.4 Flux calibration stability

2.3.5 Overlap between blue- and red-arm spectra

2.4 Telluric correction

2.5 Cubing

3. Verification of early science data

3.1 S/N distribution

3.2 Spatial resolution

3.3 Spectral resolution

3.4 WCS and orientation accuracy

3.5 Example data

3.5.1 Spectra

3.5.2 Kinematic and emission-line maps

4. Summary and conclusions

Acknowledgements

Data availability statement

Author contributions

Funding statement

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests