SEG-Y Descriptor: A Conceptual Overview

The SegyDescriptor is a structured model used to define the structure and content of a SEG-Y file. SEG-Y is a standard file format used in the geophysical industry for recording digital seismic data. In essence, this model serves as a blueprint for what a SEG-Y file should look like.

This class and its components provide a specified and flexible way to work with SEG-Y seismic data files programmatically, from defining the file structure and read/write operations, to customization for specialised use cases.

Conceptually a SEG-Y Revision 0 file looks like this on disk.

┌──────────────┐  ┌─────────────┐  ┌────────────────────┐        ┌────────────────────┐
│ Textual File    Binary File          Trace 1                     Trace N      │
│ Header 3200B │─►│ Header 400B │─►│ Header 240B + Data │─ ... ─►│ Header 240B + Data │
└──────────────┘  └─────────────┘  └────────────────────┘        └────────────────────┘

Key Components

This descriptor model consists of several important components. Each of these components represents a particular section of a SEG-Y file.


This attribute, segy_standard, corresponds to the specific SEG-Y standard that is being used. SEG-Y files can be of different revisions or standards, including custom ones.

It must be set to one of the allowed SegyStandard values.

Text File Header

The text_file_header stores the information required to parse the textual file header of the SEG-Y file. This includes important metadata that pertains to the seismic data in human-readable format.

Binary File Header

The binary_file_header item talks about the binary file header of the SEG-Y file. It is a set of structured and important information about the data in the file, stored in binary format for machines to read and process quickly and efficiently.

Binary headers are defined as StructuredDataTypeDescriptors and are built by specifying header fields in the StructuredFieldDescriptor format.

Extended Text Header

The extended_text_header is an optional attribute that provides space for extra information that can’t be fit within the regular text file header. This extended header can be used for additional human-readable metadata about the data.


Extended text headers are were added in SEG-Y Revision 1.0.


The trace component is a descriptor for both the trace headers and the associated data. Trace headers contain specific information about each individual seismic trace in the dataset, and the trace data contains the actual numerical seismic data.

The Customize Method

The customize method is a way for users to tailor an existing SEG-Y descriptor to meet their specific requirements. It’s an optional tool that provides a way to update the various parts of the descriptor including the text header, binary header, extended text header, trace header and trace data. Note that the SEGY standard is always set to custom when using this method.


pydantic model segy.schema.segy.SegyDescriptor

A descriptor class for a SEG-Y file.

field segyStandard: SegyStandard | None [Required]

SEG-Y Revision / Standard. Can also be custom.

field textFileHeader: TextHeaderDescriptor [Required]

Textual file header descriptor.

field binaryFileHeader: StructuredDataTypeDescriptor [Required]

Binary file header descriptor.

field extendedTextHeader: TextHeaderDescriptor | None = None

Extended textual header descriptor.

field trace: TraceDescriptor [Required]

Trace header + data descriptor.

field endianness: Endianness | None = None

Endianness of SEG-Y file.

customize(text_header_spec=None, binary_header_fields=None, extended_text_spec=None, trace_header_fields=None, trace_data_spec=None)

Customize an existing SEG-Y descriptor.


A modified SEG-Y descriptor with “custom” segy standard.

Return type:


pydantic model segy.schema.header.TextHeaderDescriptor

A descriptor class for SEG-Y textual headers.

field rows: int [Required]

Number of rows in text header.

field cols: int [Required]

Number of columns in text header.

field encoding: TextHeaderEncoding [Required]

String encoding.

field format: ScalarType [Required]

Type of string.

field offset: int | None = None

Starting byte offset.

  • ge = 0

property dtype: dtype[Any]

Get numpy dtype.

property itemsize: int

Number of bytes for the data type.

field description: str | None = None

Description of the field.

class segy.schema.segy.SegyStandard

Allowed values for SEG-Y standards in SegyDescriptor.

REV0 = 0.0
REV1 = 1.0
REV2 = 2.0
REV21 = 2.1
pydantic model segy.schema.segy.SegyInfo

Concise and useful information about SEG-Y files.

field uri: str [Required]

URI of the SEG-Y file.

field segyStandard: SegyStandard | None [Required]

SEG-Y Revision / Standard. Can also be custom.

field numTraces: int [Required]

Number of traces.

field samplesPerTrace: int [Required]

Trace length in number of samples.

field sampleInterval: int | float [Required]

Sampling rate from binary header.

field fileSize: int [Required]

File size in bytes.