WIP: TaskRun recoverable error, CreateContainerConfigError
neither resumes PipelineRun on fix, nor times out
#7807
Labels
kind/bug
Categorizes issue or PR as related to a bug.
Expected Behavior
Consider a
TaskRun
started by aPipelineRun
. When theTaskRun
encounters a recoverable error such as aCreateContainerConfigError
, thePipelineRun
should either resume running if theTaskRun
is fixed, or fail due to timeout if problem is not fixed in time.Actual Behavior
In the case when a
PipelineRun
starts aTaskRun
, and aCreateContainerConfigError
occurs, theTaskRun
status.conditions[].status
is"False"
(failed). However, theTaskRun
status.steps[]
indicate it is still in waiting. According to this Tekton design table, theTaskRun
is in a recoverable state.You can tell its in a recoverable state because the status has no
status.completionTime
, and in the steps, the state is inwaiting
, notterminated
:TaskRun
Although the
TaskRun
is in a recoverable state, thePipelineRun
has already terminated. There doesn't seem to be a way to recover from the failed state:PipelineRun
When the
oci-store
is provided, ... <explain how recovery doesn't happen>When pod timeout is adjusted to timeout, ... <explain how pipelinerun timeout won't happen>
Steps to Reproduce the Problem
TO DO
1.
2.
3.
Additional Info
Kubernetes version:
Output of
kubectl version
:Tekton Pipeline version:
Output of
tkn version
orkubectl get pods -n tekton-pipelines -l app=tekton-pipelines-controller -o=jsonpath='{.items[0].metadata.labels.version}'
The text was updated successfully, but these errors were encountered: