e2e

package

v1.0.5 Latest Latest Go to latest Published: May 9, 2025 License: Apache-2.0 Imports: 23 Imported by: 0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/aws/eks-hybrid

Links

README ¶

E2E Testing Guide

This guide explains how to run end-to-end (E2E) tests locally and outlines the key principles of our E2E testing framework.

Core Principles

Idempotent Setup and Cleanup: The setup and cleanup commands are designed to be rerunnable without causing errors or duplicate resources.
Self-Cleaning Tests: Each test is responsible for cleaning up any resources it creates during execution.
Comprehensive Resource Management: When new infrastructure resources are added to tests, they must also be handled by the sweeper cleanup mechanism.
Strategic Retry Management:
- Infrastructure operations (AWS API calls, Kubernetes API operations) must implement appropriate retry mechanisms with exponential backoff
- Test verification steps should include retries with reasonable timeouts to handle eventual consistency
- nodeadm operations must NOT include retries in test code - any reliability improvements for nodeadm operations must be implemented within nodeadm itself to ensure accurate product behavior assessment

Running E2E Tests Locally

Using the run-e2e Subcommand

The run-e2e subcommand provides a streamlined way to run E2E tests with configurable options. Here's a basic example:

```
make e2e-test ginkgo
```

./_bin/e2e-test run-e2e -f="al23-amd64 && simpleflow && ssm" --skip-cleanup=true --artifacts-dir=e2e-artifacts --logs-bucket=<s3 logs bucket>

Passing --skip-cleanup allows for quickly reruning additional test cases without having to wait for new cluster creation/teardown.

After running run-e2e to create your test infrastructure, you could use the following when rerunning:

 ./_bin/ginkgo  -v -tags=e2e --label-filter='al23-amd64 && simpleflow && ssm'  ./test/e2e/suite/nodeadm -- -filepath=e2e-artifacts/configs/e2e-param.yaml

Key Options

-f, --test-filter: Filter tests using ginkgo label filters (e.g., "al23-amd64 && simpleflow && ssm")
--skip-cleanup: Skip cleanup after test completion
--artifacts-dir: Directory to store test artifacts (defaults to a temporary directory)
--cni: CNI plugin to use (default: "cilium")
--logs-bucket: S3 bucket for uploading test logs
--no-color: Disable colored output
-n, --name: Cluster name (default: "nodeadm-e2e-tests-1-31")
-r, --region: AWS region (default: "us-west-2")
-k, --kubernetes-version: Kubernetes version (default: "1.31")
-p, --procs: Number of processes to run (default: 1)
--timeout: Test timeout (default: "60m")
--setup-config: Path to YAML file containing cluster.TestResources configuration
--test-config: Path to YAML file containing suite.TestConfig configuration
--ginkgo-binary: Path to the ginkgo binary (defaults to binary in same folder as e2e-test or in PATH)

You can run the tests manually using the CLI commands:

make:
```
make e2e-test ginkgo
```

Setup the test infrastructure:

./_bin/e2e-test setup -s path/to/e2e-config.yaml

Run a single test:

 ./_bin/ginkgo  -v -tags=e2e --label-filter='al23-amd64 && simpleflow && ssm'  ./test/e2e/suite/nodeadm -- -filepath=path/to/e2e-param.yaml

Clean up the infrastructure:

# Using the resources file (for specific cleanup)
./_bin/e2e-test cleanup -f path/to/resources.yaml

Configuration Files

Cluster Resources Configuration (e2e-config.yaml)

This file defines the infrastructure to be created for testing:

clusterName: nodeadm-e2e-tests-1-31
clusterRegion: us-west-2
clusterNetwork:
  vpcCidr: 10.0.0.0/16
  publicSubnetCidr: 10.0.10.0/24
  privateSubnetCidr: 10.0.20.0/24
hybridNetwork:
  vpcCidr: 10.1.0.0/16
  publicSubnetCidr: 10.1.1.0/24
  privateSubnetCidr: 10.1.2.0/24
  podCidr: 10.2.0.0/16
kubernetesVersion: "1.31"
cni: cilium
endpoint: ""

cni - cilium or calico
endpoint - optional, intended to be used for testing against beta or other environments

Test Parameters Configuration (e2e-param.yaml)

This file configures the test execution parameters:

clusterName: nodeadm-e2e-tests-1-31
clusterRegion: us-west-2
nodeadmUrlAMD: https://hybrid-assets.eks.amazonaws.com/releases/latest/bin/linux/amd64/nodeadm
nodeadmUrlARM: https://hybrid-assets.eks.amazonaws.com/releases/latest/bin/linux/arm64/nodeadm
setRootPassword: false
logsBucket: ""
endpoint: ""
artifactsFolder: ""

setRootPassword: optional, if true, newly created EC2 instances will have a randomly set root password for logging into
logsBucket: optional, if set, test will collect logs bundle and upload to bucket
endpoint - optional, intended to be used for testing against beta or other environments
artifactsFolder: optional, if set, tests boot logs/junit/json ginkgo output will be written to, otherwise a tmp folder is used
Note: the above files could be combined into one and the folder e2e-config is in the gitignore and is a good place to store these files.

Cleanup Options

Using the Cleanup Command

The cleanup command requires a resources file and removes specific resources:

./_bin/e2e-test cleanup --filename path/to/resources.yaml

Using the Sweeper Command

The sweeper command provides more flexible cleanup options:

# Clean up a specific cluster
./_bin/e2e-test sweeper --cluster-name nodeadm-e2e-tests-1-31

# Clean up clusters with a specific prefix
./_bin/e2e-test sweeper --cluster-prefix nodeadm-e2e-tests- --age-threshold 12h

# Clean up all test clusters older than 24 hours
./_bin/e2e-test sweeper --all --age-threshold 24h

# Dry run to see what would be deleted without making changes
./_bin/e2e-test sweeper --cluster-prefix nodeadm-e2e-tests- --dry-run

Managing Test Nodes

Creating Test Nodes

The create subcommand allows you to create individual test nodes for your E2E test cluster. This is useful for testing specific node configurations or when you need to add nodes to an existing cluster.

Basic usage:

./_bin/e2e-test create <INSTANCE_NAME> -f path/to/e2e-param.yaml

Key options:

-f, --config-file: Path to the test configuration file (required)
-c, --creds-provider: Credentials provider to use (iam-ra, ssm)
-o, --os: Operating system to use (al23, ubuntu2004, ubuntu2204, ubuntu2404, rhel8, rhel9)
-a, --arch: Architecture to use (amd64, arm64)
-w, --wait-for-ready: Wait for the node to be ready in the cluster

Example:

# Create an Amazon Linux 2023 AMD64 node with SSM credentials
./_bin/e2e-test create test-node-1 -f e2e-artifacts/configs/e2e-param.yaml -c ssm -o al23 -a amd64 -w

# Create an Ubuntu 22.04 ARM64 node with IAM Roles Anywhere credentials
./_bin/e2e-test create test-node-2 -f e2e-artifacts/configs/e2e-param.yaml -c iam-ra -o ubuntu2204 -a arm64

Note: For RHEL nodes, you need to set the following environment variables:

RHEL_USERNAME: Your Red Hat subscription username
RHEL_PASSWORD: Your Red Hat subscription password

SSH into Test Nodes

The ssh subcommand allows you to connect to test nodes through the jumpbox instance. This is useful for debugging or manual testing.

Basic usage:

./_bin/e2e-test ssh <INSTANCE_ID>

Example:

# SSH into a node using its instance ID
./_bin/e2e-test ssh i-0123456789abcdef0

The command will:

Find the instance in AWS
Identify the cluster it belongs to
Locate the jumpbox instance
Establish an SSH connection through SSM

Note: The SSH connection is established through AWS Systems Manager Session Manager, so you don't need to manage SSH keys or security groups directly.

Documentation ¶

Index ¶

func NewAWSConfig(ctx context.Context, optFns ...func(*config.LoadOptions) error) (aws.Config, error)
func NewEKSClient(aws aws.Config, endpoint string) *eks.Client
func NewLogger(opts ...LoggerOption) logr.Logger
func SanitizeForAWSName(input string) string
func Truncate(name string, limit int) string
type Cluster
type CredsOS
type EksResolverV2
- func (r *EksResolverV2) ResolveEndpoint(ctx context.Context, params eks.EndpointParameters) (smithyendpoints.Endpoint, error)
type File
type HybridEC2Node
type InstanceSize
type LoggerConfig
- func (c LoggerConfig) Apply(opts *LoggerConfig)
type LoggerOption
- func WithOutputFile(filename string) LoggerOption
- func WithWriter(w io.Writer) LoggerOption
type LogsUploadUrl
type NodeSpec
type NodeadmCredentialsProvider
type NodeadmOS
type NodeadmURLs
type PausableLogger
- func NewPausableLogger(opts ...LoggerOption) PausableLogger
- func (l PausableLogger) Pause()
- func (l PausableLogger) Resume() error
type SwitchWriter
- func NewSwitchWriter(active io.Writer) *SwitchWriter
- func (s *SwitchWriter) Pause()
- func (s *SwitchWriter) Resume() error
- func (s *SwitchWriter) Write(p []byte) (int, error)
type TestConfig
- func ReadConfig(configPath string) (*TestConfig, error)
type UserDataInput

Constants ¶

This section is empty.

Variables ¶

This section is empty.

Functions ¶

func NewAWSConfig ¶ added in v1.0.4

func NewAWSConfig(ctx context.Context, optFns ...func(*config.LoadOptions) error) (aws.Config, error)

func NewEKSClient ¶ added in v1.0.4

func NewEKSClient(aws aws.Config, endpoint string) *eks.Client

func NewLogger ¶

func NewLogger(opts ...LoggerOption) logr.Logger

func SanitizeForAWSName ¶ added in v1.0.1

func SanitizeForAWSName(input string) string

SanitizeForAWSName removes everything except alphanumeric characters and hyphens from a string.

func Truncate ¶ added in v1.0.1

func Truncate(name string, limit int) string

Truncate drops characters from the end of a string if it exceeds the limit.

Types ¶

type Cluster ¶

type Cluster struct {
	Name   string
	Region string
}

type CredsOS ¶ added in v1.0.1

type CredsOS interface {
	Name() string
}

CredsOS is the Node OS.

type EksResolverV2 ¶ added in v1.0.2

type EksResolverV2 struct {
	Endpoint string
}

EksResolverV2 is used to resolve custom endpoints for EKS clients.

func (*EksResolverV2) ResolveEndpoint ¶ added in v1.0.2

func (r *EksResolverV2) ResolveEndpoint(ctx context.Context, params eks.EndpointParameters) (
	smithyendpoints.Endpoint, error,
)

ResolveEndpoint resolves to a custom endpoint if not empty or default otherwise.

type File ¶ added in v1.0.1

type File struct {
	Content     string
	Path        string
	Permissions string
}

File represents a file in disk.

type HybridEC2Node ¶ added in v1.0.1

type HybridEC2Node struct {
	Node corev1.Node
}

HybridEC2Node represents a Hybrid Node backed by an EC2 instance.

type InstanceSize ¶ added in v1.0.5

type InstanceSize int

const (
	Large InstanceSize = iota
	XLarge
)

type LoggerConfig ¶ added in v1.0.2

type LoggerConfig struct {
	NoColor bool
	// contains filtered or unexported fields
}

func (LoggerConfig) Apply ¶ added in v1.0.2

func (c LoggerConfig) Apply(opts *LoggerConfig)

type LoggerOption ¶ added in v1.0.2

type LoggerOption interface {
	Apply(*LoggerConfig)
}

func WithOutputFile ¶ added in v1.0.4

func WithOutputFile(filename string) LoggerOption

WithOutputFile returns a LoggerOption that configures the logger to write to both the specified file and stdout.

func WithWriter ¶ added in v1.0.5

func WithWriter(w io.Writer) LoggerOption

WithWriter returns a LoggerOption that sets the output writer to the given io.Writer.

type LogsUploadUrl ¶ added in v1.0.1

type LogsUploadUrl struct {
	Name string
	Url  string
}

type NodeSpec ¶ added in v1.0.1

type NodeSpec struct {
	Cluster  *Cluster
	Name     string
	OS       CredsOS
	Provider NodeadmCredentialsProvider
}

NodeSpec is a specification for a node.

type NodeadmCredentialsProvider ¶ added in v1.0.1

type NodeadmCredentialsProvider interface {
	Name() creds.CredentialProvider
	NodeadmConfig(ctx context.Context, node NodeSpec) (*api.NodeConfig, error)
	VerifyUninstall(ctx context.Context, instanceId string) error
	FilesForNode(spec NodeSpec) ([]File, error)
}

type NodeadmOS ¶ added in v1.0.1

type NodeadmOS interface {
	Name() string
	AMIName(ctx context.Context, awsConfig aws.Config) (string, error)
	BuildUserData(userDataInput UserDataInput) ([]byte, error)
	InstanceType(region string, instanceSize InstanceSize) string
}

NodeadmOS defines an interface for operating system-specific behavior.

type NodeadmURLs ¶ added in v1.0.1

type NodeadmURLs struct {
	AMD string
	ARM string
}

type PausableLogger ¶ added in v1.0.3

type PausableLogger struct {
	logr.Logger
	// contains filtered or unexported fields
}

PausableLogger can be paused and resumed. It wraps a logr.Logger. When paused, writes go to a buffer; when resumed, writes go to stdout. After it's resumed, the buffer is flushed to stdout.

func NewPausableLogger ¶ added in v1.0.3

func NewPausableLogger(opts ...LoggerOption) PausableLogger

NewPausableLogger returns a logger that can be paused and resumed.

func (PausableLogger) Pause ¶ added in v1.0.3

func (l PausableLogger) Pause()

func (PausableLogger) Resume ¶ added in v1.0.3

func (l PausableLogger) Resume() error

type SwitchWriter ¶ added in v1.0.3

type SwitchWriter struct {
	// contains filtered or unexported fields
}

SwitchWriter implements io.Writer. When paused, writes go to a buffer; when resumed, writes go to the active writer.

func NewSwitchWriter ¶ added in v1.0.3

func NewSwitchWriter(active io.Writer) *SwitchWriter

func (*SwitchWriter) Pause ¶ added in v1.0.3

func (s *SwitchWriter) Pause()

Pause causes subsequent writes to be buffered.

func (*SwitchWriter) Resume ¶ added in v1.0.3

func (s *SwitchWriter) Resume() error

Resume flushes the buffer to the active writer and resumes normal output.

func (*SwitchWriter) Write ¶ added in v1.0.3

func (s *SwitchWriter) Write(p []byte) (int, error)

type TestConfig ¶ added in v1.0.4

type TestConfig struct {
	ClusterName     string `yaml:"clusterName"`
	ClusterRegion   string `yaml:"clusterRegion"`
	NodeadmUrlAMD   string `yaml:"nodeadmUrlAMD"`
	NodeadmUrlARM   string `yaml:"nodeadmUrlARM"`
	SetRootPassword bool   `yaml:"setRootPassword"`
	NodeK8sVersion  string `yaml:"nodeK8SVersion"`
	LogsBucket      string `yaml:"logsBucket"`
	Endpoint        string `yaml:"endpoint"`
	// ArtifactsFolder is the local path where the test will store the artifacts.
	ArtifactsFolder string `yaml:"artifactsFolder"`
}

func ReadConfig ¶ added in v1.0.4

func ReadConfig(configPath string) (*TestConfig, error)

ReadConfig reads the configuration from the specified file path and unmarshals it into the TestConfig struct.

type UserDataInput ¶ added in v1.0.1

type UserDataInput struct {
	CredsProviderName string
	EKSEndpoint       string
	KubernetesVersion string
	NodeadmUrls       NodeadmURLs
	NodeadmConfigYaml string
	Provider          string
	PublicKey         string
	Region            string
	RootPasswordHash  string
	Files             []File
}

Source Files ¶

View all Source files

Directories ¶

Path	Synopsis
addon
cfn
cleanup
cluster
cni
commands
constants
credentials
ec2
errors
kubernetes
nodeadm
os
peered
run
s3
ssh
ssm
suite

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL