intel / workload-services-framework Goto Github PK
View Code? Open in Web Editor NEWLicense: Apache License 2.0
License: Apache License 2.0
When running the OpenSSL3-RSAMB workload with various qatsw_* test cases, the Kubernetes configuration consistently points to "qat-rsa"
v23.2
-- Setting: PLATFORM=SRF, ARCH=linux/amd64
-- Setting: REGISTRY=docker-registry.services.svc.cluster.local:5000/workload_public/openssl3-rsamb/
-- Setting: RELEASE=:v23.2
-- Setting: TIMEOUT=86400
-- Setting: BENCHMARK=OpenSSL3-RSAMB
-- Setting: BACKEND=kubernetes
During our testing of OpenSSL3-RSAMB with multiple qatsw_* test cases, we observed that the test_openssl3_rsamb_qatsw_* consistently produces RSA KPI results. An interesting observation is that the Kubernetes configuration file (kubernetes-config.yaml) is generated by ctest.sh sets the CONFIG environment variable to "qat-rsa" for all these test cases.
The below kubernetes-config.yaml is from running ctest.sh with test_openssl3_rsamb_qatsw_aes-gcm testcase,
#
# Apache v2 license
# Copyright (C) 2023 Intel Corporation
# SPDX-License-Identifier: Apache-2.0
#
apiVersion: batch/v1
kind: Job
metadata:
name: benchmark
spec:
template:
spec:
containers:
- name: benchmark
image: openssl3-rsamb-qat-sw:v23.2
imagePullPolicy: Always
env:
containers:
- name: benchmark
image: openssl3-rsamb-qat-sw:v23.2
imagePullPolicy: Always
env:
- name: CONFIG
value: "qat-rsa"
securityContext:
privileged: true
restartPolicy: Never
backoffLimit: 4
#
# Apache v2 license
# Copyright (C) 2023 Intel Corporation
# SPDX-License-Identifier: Apache-2.0
#
apiVersion: batch/v1
kind: Job
metadata:
name: benchmark
spec:
template:
spec:
containers:
- name: benchmark
image: openssl3-rsamb-qat-sw:v23.2
imagePullPolicy: Always
env:
containers:
- name: benchmark
image: openssl3-rsamb-qat-sw:v23.2
imagePullPolicy: Always
env:
- name: CONFIG
value: "qat-aes-gcm"
securityContext:
privileged: true
restartPolicy: Never
backoffLimit: 4
When building a Docker image for the HammerDB-TPCC workload using the make
command with the v23.3 release, the generated Docker image has a different name compared to the one specified in Kubernetes configuration files. This issue was not present in the v23.2 release.
The Kubernetes configuration file expects the image name to be: tpcc-mysql8031-base:v23.3
, but the make
build generates the Docker image name as: mysql8031-base:v23.3
. This discrepancy has been observed across multiple workloads, including Fio, BertLarge-PyTorch, SmartScience, Video-Structure, CDN-NGINX, 3DHuman-Pose-Estimation, SpecCpu-2017, ResNet50-PyTorch, SPDK-NVMe-o-TCP, and Istio-Envoy.
v23.3
PLATFORM=
REGISTRY=
RELEASE=:v23.3
TIMEOUT=86400
BACKEND=kubernetes
We are encountering an issue while attempting to build Docker images for the HammerDB-TPCC workload using the make
command from the workload directory. The problem lies in the inconsistency between the Docker image name generated by make
and the image name specified in our Kubernetes manifest files. In the case of v23.3 release, the Kubernetes config file expects the image name to be: tpcc-mysql8031-base:v23.3
, while the make
build process generates the Docker image name as: mysql8031-base:v23.3
.
This inconsistency has also been identified across several other workloads, including Fio, BertLarge-PyTorch, SmartScience, Video-Structure, CDN-NGINX, 3DHuman-Pose-Estimation, SpecCpu-2017, ResNet50-PyTorch, SPDK-NVMe-o-TCP, and Istio-Envoy.
The image name specified in the Kubernetes configuration file should match the Docker image name generated by the make
build process: tpcc-mysql8031-base:v23.3
.
We're trying to build docker images for the WSF external workloads using the make command with the v23.3 release, the docker image builds are failing for some of the workloads with various issues.
ERROR: Could not find a version that satisfies the requirement tornado==6.3.3
ERROR: No matching distribution found for tornado==6.3.3
ERROR: failed to solve: failed to compute cache key: failed to calculate the checksum of ref 8f74b3ec-14bb-4b3f-bf4a-2415442bc78c::s1b8k8203tvipdcv00euqyggg: "/data": not found
ERROR: failed to solve: failed to compute cache key: failed to calculate checksum of ref 7076f431-11b5-40fa-bd05-b9b81ac8589c::fheik60wiqbjlxsagh6d5d4hc: "/script": not found
Docker builds are not happening when trying to run make from the workload directory. Looks like there is a bug with Make Steps.
AssertionError: Framework is not detected correctly from the model format. This could be caused by an unsupported model or inappropriate framework installation.
Same issue as SPDK-NVMe-o-TCP
ERROR: Could not find a version that satisfies the requirement tornado==6.3.3
ERROR: No matching distribution found for tornado==6.3.3
ERROR: failed to solve: failed to compute cache key: failed to calculate the checksum of ref 8f74b3ec-14bb-4b3f-bf4a-2415442bc78c::lsi9ljjqcrg6rzsiznnqlueve: "/motion-tracking-sdk": not found
We are exploring the possibility of running WSF external workloads on a K8s pod. We are considering the 23.1 branch of WSF. Our primary objective is to determine if it's feasible to execute these workloads directly on the pod (just like a local execution), without the need to pull Docker images for running the workload.
Kubernetes EKS 1.23
Ability to run workloads on a pod without using the docker image of Terraform as it does on Jenkins jobs. Can we achieve this?
No picture shown in this readme file, looks like the image is lost https://github.com/intel/workload-services-framework/blob/main/doc/image/sgx-memory.png, I recall that I have added the picture in the original file [https://github.com/intel/workload-services-framework/blob/main/doc/setup-gramine-sgx.md].
As part of building docker images for ResNet50-PyTorch-Xeon-Public and BERTLarge-PyTorch-Xeon-Public workloads, building PyTorch-Xeon intermediary image failed while setting up Conda environment.
One of the dependency package tornado where latest versions stopped support for Python 3.7 and throwing following error
#10 27.09 ERROR: Could not find a version that satisfies the requirement tornado==6.3.2
Couple of options to resolve this
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.