Shell commands table

Posted on 2024-11-27

Category	Command	Sub-command	Description
File Operations and Text Processing	`cat`	`cat filename`	Displays the contents of `filename`.
		`cat file1 file2 > combined_file`	Concatenates `file1` and `file2` into `combined_file`.
	`grep`	`grep "pattern" file`	Searches for "pattern" in `file`.
		`grep -r "pattern" directory/`	Recursively searches for "pattern" in `directory` and its subdirectories.
		`grep -i "pattern" file`	Searches for "pattern" in `file` case-insensitively.
	`sed`	`sed 's/old/new/g' file`	Replaces all occurrences of "old" with "new" in `file`.
		`sed -i '1d' file`	Deletes the first line of `file` in place.
	`awk`	`awk -F"\\t" '{print $8}' file`	Prints the 8th column of a tab-separated `file`.
		`awk '{sum += $1} END {print sum}' file`	Sums the first column of `file` and prints the total.
	`sort`	`sort file`	Sorts the lines in `file` alphabetically.
		`sort -t $'\\t' -k 2 file`	Sorts `file` based on the second tab-separated column.
		`sort -t $'\\t' -rk 2 file`	Sorts `file` in reverse order based on the second tab-separated column.
	`uniq`	`uniq file`	Removes duplicate lines from `file`.
		`uniq -c file`	Counts the number of occurrences of each line in `file`.
	`split`	`split -l 10000 file.txt`	Splits `file.txt` into smaller files with 10,000 lines each.
	`wc`	`wc -l file`	Counts the number of lines in `file`.
		`wc -w file`	Counts the number of words in `file`.
	`join`	`join file1 file2`	Joins lines of `file1` and `file2` on a common field.
	`head`	`head -n 100 filename`	Displays the first 100 lines of `filename`.
	`tail`	`tail -n +1000 filename`	Displays lines starting from line 1000 of `filename`.
Data Statistics and Analysis	`xargs`	`ls \| xargs grep "pattern"`
	`tee`	`command \| tee file`
		`command \| tee -a file`
Compression and Archiving	`gzip`	`gzip filename`	Compresses `filename` using the gzip algorithm.
		`gzip -d filename.gz`	Decompresses `filename.gz`.
	`tar`	`tar -cvf archive.tar /path/to/directory`	Creates a tar archive without compression.
		`tar -zcvf archive.tar.gz /path/to/directory`	Creates a tar archive with gzip compression.
		`tar -xf archive.tar -C /destination`	Extracts a tar archive to the specified destination.
		`tar -xzf archive.tar.gz -C /destination`	Extracts a gzip-compressed tar archive to the specified destination.
		`tar -cvf archive.tar /path --exclude=log --exclude=data`	Creates a tar archive while excluding files matching patterns.
	`zip`	`zip archive.zip file1 file2`	Compresses `file1` and `file2` into `archive.zip`.
		`zip -r archive.zip directory/`	Recursively compresses `directory` into `archive.zip`.
Data Flow and Process Management	`ps`	`ps aux --sort=-%mem`	Lists processes sorted by memory usage.
		`ps -ef`	Displays all running processes.
		`ps -eaf`	Another variant to display all processes.
	`top`	`top`	Displays real-time system processes and resource usage.
	`kill`	`kill -9 $pid`	Forcefully terminates a process with the specified PID.
	`pgrep`	`pgrep process_name`	Searches for processes by name and returns their PIDs.
	`bg`	`bg %job`	Resumes a suspended job in the background.
	`jobs`	`jobs`	Lists active jobs in the current shell.
	`nohup`	`nohup command > output.log 2>&1 &`	Runs `command` immune to hangups, redirecting output to `output.log` and running it in the background.
Network and File Transfer	`wget`	`wget <http://example.com/file.zip`>	Retrieves `file.zip` from the specified URL.
		`wget -O output.txt <http://example.com/data`>	Downloads data from the specified URL and saves it as `output.txt`.
	`scp`	`scp file.txt user@remote:/path/`	Securely copies `file.txt` to a remote host.
		`scp -r /local/dir user@remote:/path/`	Securely copies a directory recursively to a remote host.
	`netstat`	`netstat -tunpl \\| grep [port]`	Lists listening ports and associated processes.
		`netstat -nap \\| grep [pid]`	Shows network connections for a specific PID.
	`nc` (netcat)	`nc -zv host port`	Scans `host` on `port` to check if it's open.
		`nc host port`	Connects to `host` on `port` for data transfer or communication.
System Information and Monitoring	`df`	`df -h`	Reports file system disk space usage in a human-readable format.
		`df -T`	Shows the type of file system.
	`du`	`du -h --max-depth=1`	Displays disk usage in a human-readable format, limited to one directory level.
		`du -sh test_dir`	Shows the total disk usage of `test_dir`.
	`iostat`	`iostat`	Reports CPU and I/O statistics for devices and partitions.
File Search	`find`	`find . -name "*.log"`	Searches for all `.log` files in the current directory and subdirectories.
		`find /path -type f -size +100M`	Finds files larger than 100MB in `/path`.
	`which`	`which gcc`	Locates the executable path for `gcc`.
Permission Management	`chmod`	`chmod u+r file`	Adds read permission for the user on `file`.
		`chmod o-r file`	Removes read permission for others on `file`.
		`chmod 755 script.sh`	Sets permissions to rwxr-xr-x for `script.sh`.
	`chown`	`chown user:group file`	Changes ownership of `file` to `user` and `group`.
		`chown -R user:group directory/`	Recursively changes ownership of `directory` and its contents to `user` and `group`.
	`ls -l`	`ls -l`	Lists directory contents in long format, showing permissions and ownership.
Other Tools	`env`	`env`	Displays the current environment variables.
		`env VAR=value command`	Sets an environment variable `VAR` to `value` for the duration of `command`.
	`date`	`date`	Displays the current date and time.
		`date +"%Y-%m-%d"`	Outputs the date in YYYY-MM-DD format.
	`watch`	`watch -n 1 ls`	Executes `ls` every second, updating the display.
	`alias`	`alias ll='ls -al'`	Creates an alias `ll` for `ls -al`.
		`alias gs='git status'`	Creates an alias `gs` for `git status`.
Advanced Tools	`jq`	`jq '.' file.json`	Parses and formats JSON data from `file.json`.
		`jq '.key' file.json`	Extracts the value of `key` from `file.json`.
Network Configuration and Management	`netplan`	`netplan apply`	Applies the network configuration defined in Netplan YAML files.
	`ip`	`ip addr add 10.240.224.117/24 dev ens9f0`	Adds an IP address to the network interface `ens9f0`.
		`ip route add default via 10.240.224.1`	Adds a default gateway route via `10.240.224.1`.
		`ip a sh dev ens1f0`	Shows the address information for the device `ens1f0`.
		`ip l s ens1f0 up`	Sets the link state of `ens1f0` to up.
	`ifconfig`	`ifconfig up ens9f0`	Brings up the network interface `ens9f0`.
		`ifconfig ens9f0`	Displays the configuration of the network interface `ens9f0`.
Networking Utilities	`nslookup`	`nslookup child-prc.intel.com`	Queries DNS to obtain domain name information for `child-prc.intel.com`.

1. Introduction to Time Series Data

TL;DR

Time series data is a sequence of data points collected or recorded at specific time intervals, typically used to track changes or trends over time.

Data Schema

Structure: identifier -> (t0, v0), (t1, v1), (t2, v2), ...

Data in Prometheus

Format: <metric_name>{<label_name>=<label_value>, ...}

Example of a typical set of series identifiers (data model):


{ "__name__": "http_requests_total", "pod": "example-pod", "job": "example-job", "path": "/api/v1/resource", "status": "200", "method": "GET"} @1430000000 94355
{ "__name__": "http_requests_total", "pod": "example-pod", "job": "example-job", "path": "/api/v1/resource", "status": "200", "method": "PUT"} @1435000000 94355
{ "__name__": "http_requests_total", "pod": "example-pod", "job": "example-job", "path": "/api/v1/resource", "status": "200", "method": "POST"} @1439999999 94355

Components:

Key: Series

Metric Name: __name__

Labels:

1	{"pod": "example-pod", "job": "example-job", "path": "/api/v1/resource", "status": "200", "method": "GET"}

Timestamp: Recorded time of the sample

Value: Sample value

How to Query:

Example Queries:
- __name__="http_requests_total" - Selects all series belonging to the http_requests_total metric
- method="PUT|POST" - Selects all series where the method is either PUT or POST

High Availability (HA) and Reliability are two important concepts in system design, but they address different aspects of system performance and robustness. Below, I'll provide code examples and explanations to illustrate the differences between HA and Reliability.

High Availability (HA)

High Availability focuses on ensuring that a system is operational and accessible for as much time as possible. This often involves redundancy and failover mechanisms to minimize downtime.

Example: High Availability with Load Balancer and Multiple Instances


*# Example using Flask and Gunicorn for a web application# app.py*
from flask import Flask

app = Flask(__name__)

@app.route('/')
def hello():
    return "Hello, World!"

if __name__ == '__main__':
    app.run(host='0.0.0.0', port=5000)

why the function of controller in kubernete operator is called "reconcile"

Posted on 2024-11-06

In Kubernetes, the term "reconcile" is used to describe the process by which an operator controller ensures that the current state of a resource matches the desired state specified by the user.

The name "reconcile" is derived from the concept of reconciliation, which means to make consistent or congruent.

Desired State vs. Current State:
- Kubernetes operates on a declarative model where users specify the desired state of the system using YAML or JSON manifests.
- The actual state of the system is the current state of the resources as observed in the cluster.
Reconciliation Loop:
- The core responsibility of a Kubernetes controller (including operators) is to continuously monitor the current state of resources and compare it with the desired state.
- If there is a discrepancy between the desired state and the current state, the controller takes actions to bring the current state in line with the desired state. This process is known as reconciliation.
Reconcile Function:
- The "reconcile" function is the heart of this process. It is called whenever there is a change in the resource or periodically to ensure the desired state is maintained.
- The function typically involves reading the current state of the resource, comparing it with the desired state, and then performing the necessary operations (such as creating, updating, or deleting resources) to reconcile the two states.
Idempotency:
- The reconcile function is designed to be idempotent, meaning that running it multiple times with the same input should produce the same result. This ensures that the system remains stable and consistent even if the function is triggered multiple times.
Event-Driven:
- The reconciliation process is often event-driven. When a resource changes (e.g., a new pod is created, or a deployment is updated), an event is generated, and the reconcile function is triggered to handle the change.

In summary, the name "reconcile" aptly describes the function's role in ensuring that the actual state of the system matches the desired state as defined by the user. It reflects the continuous and iterative nature of the process, where the controller works to "reconcile" any differences between the two states.

graph TD
    A[User Request] -->|kubectl| B[API Server]
    B --> C[etcd]
    B --> D[Controller Manager]
    D -->|Reconcile Loop| E[Custom Controller]
    E -->|Check Desired State| F[etcd]
    E -->|Check Current State| G[API Server]
    E -->|Update Resources| H[Scheduler]
    H --> I[Nodes]
    I -->|Run Pods| J[Actual State]
    J -->|Report Status| G
    G -->|Update Status| F
    F -->|Store State| C

Kafka cluster networking

Posted on 2024-10-31

graph TB
    subgraph "Kafka Cluster"
        Broker1["Broker 1"]
        Broker2["Broker 2"]
        Broker3["Broker 3"]
    end

    subgraph "Topic: my-topic (3 Partitions)"
        P0["Partition 0"]
        P1["Partition 1"]
        P2["Partition 2"]
    end

    P0 --> LeaderP0["Leader (Broker 1)"]
    P0 --> FollowerP0_B2["Follower (Broker 2)"]
    P0 --> FollowerP0_B3["Follower (Broker 3)"]

    P1 --> LeaderP1["Leader (Broker 2)"]
    P1 --> FollowerP1_B3["Follower (Broker 3)"]
    P1 --> FollowerP1_B1["Follower (Broker 1)"]

    P2 --> LeaderP2["Leader (Broker 3)"]
    P2 --> FollowerP2_B1["Follower (Broker 1)"]
    P2 --> FollowerP2_B2["Follower (Broker 2)"]

    Producer["Producer"] -->|Write to Leader| LeaderP0
    Producer -->|Write to Leader| LeaderP1
    Producer -->|Write to Leader| LeaderP2

    ConsumerGroup1["Consumer Group 1"] -->|Consume from Partition 0| LeaderP0
    ConsumerGroup1 -->|Consume from Partition 1| LeaderP1
    ConsumerGroup1 -->|Consume from Partition 2| LeaderP2

    ConsumerGroup2["Consumer Group 2"] -->|Consume from Partition 0| LeaderP0
    ConsumerGroup2 -->|Consume from Partition 1| LeaderP1
    ConsumerGroup2 -->|Consume from Partition 2| LeaderP2

    Zookeeper["ZooKeeper / KRaft"] -->|Manage Metadata & Leader Election| Broker1
    Zookeeper --> Broker2
    Zookeeper --> Broker3

Outside network request access the k8s operator

Posted on 2024-10-29 Edited on 2024-11-06

how does the outside (cluster) network request access the k8s operator and final the operator handle the process? answer the process in low level, suck as tcp/ip, k8s service mechanism, CRD, operator reconcile, manager and controller in operator, etc

flowchart TD
  A[External Request] --> B[DNS Resolution]
  B --> C[TCP/IP Connection Established]
  C --> D[Load Balancer / Ingress Controller]

  D --> E[Kubernetes Service]
  E --> F[Forward Request to Operator Pod]

  subgraph Operator Pod Components
    F --> G[Operator Manager]
    G --> H[Controller Watches CRD Changes]
    H --> I[Reconciliation Loop]
    I --> J[Current State Assessment]
    J --> K[Compute Difference]
    K --> L[Execute Changes to Reach Desired State]
  end

  I --> Z[Sync State with CRD]
  L --> Z

To understand how an external network request reaches a Kubernetes (k8s) operator and how the operator processes it, we need to dissect the journey step by step, focusing on the underlying mechanisms like TCP/IP, Kubernetes services, Custom Resource Definitions (CRDs), and the internal workings of an operator, including the reconcile loop, managers, and controllers.

Loki Storage Optimization

Posted on 2024-10-29 Edited on 2024-11-06

1. Use Cold/Hot Storage Separation

Approach:

Hot Data (logs from the last few days or weeks): Stored in fast storage (like SSDs) for frequent access.
Cold Data (historical logs): Stored in object storage (like MinIO, S3) for long-term archiving and infrequent access.

Example Loki Configuration:

storage_config:
  boltdb_shipper:
    active_index_directory: /var/loki/index  # Hot data directory
    shared_store: s3  # Use MinIO/S3 as cold data storage
    cache_location: /var/loki/cache  # Cache directory

  aws:
    s3: http://minio-service.minio.svc.cluster.local:9000  # Address of MinIO
    bucketnames: loki-logs
    access_key_id: minio
    secret_access_key: minio123

Optimization Effect:

Reduces local storage pressure by moving historical logs to object storage.
Improves query performance: prioritizes hot data queries, with slightly higher latency for cold data queries but at lower costs.
Read more »