Class SamplingOptions (0.5.0)

SamplingOptions(
 max_download_size: typing.Optional[int] = 500,
 enable_downsampling: bool = False,
 sampling_method: typing.Literal["head", "uniform"] = "uniform",
 random_state: typing.Optional[int] = None,
)

Encapsulates configuration for data sampling.

Attributes

Name Description
max_download_size int, default 500
Download size threshold in MB. If value set to None, the download size won't be checked.
enable_downsampling bool, default False
Whether to enable downsampling, If max_download_size is exceeded when downloading data (e.g., to_pandas()), the data will be downsampled if enable_downsampling is True, otherwise, an error will be raised.
sampling_method str, default "uniform"
Downsampling algorithms to be chosen from, the choices are: "head": This algorithm returns a portion of the data from the beginning. It is fast and requires minimal computations to perform the downsampling.; "uniform": This algorithm returns uniform random samples of the data.
random_state int, default None
The seed for the uniform downsampling algorithm. If provided, the uniform method may take longer to execute and require more computation.

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2025年10月27日 UTC.