applications/system

condor - HTCondor: High Throughput Computing

Website: https://htcondor.org/
License: Apache-2.0
Description:
HTCondor is a specialized workload management system for
compute-intensive jobs. Like other full-featured batch systems, HTCondor
provides a job queuing mechanism, scheduling policy, priority scheme,
resource monitoring, and resource management. Users submit their
serial or parallel jobs to HTCondor, HTCondor places them into a queue,
chooses when and where to run the jobs based upon a policy, carefully
monitors their progress, and ultimately informs the user upon
completion.

Packages

condor-24.0.2-1.el9.src [11.7 MiB] Changelog by Tim Theisen (2024-11-26):
- Add STARTER_ALWAYS_HOLD_ON_OOM to minimize confusion about memory usage
- Fix bug that caused condor_ssh_to_job sftp and scp modes to fail
- Fix KeyboardIdle attribute in dynamic slots that could prevent job start
- No longer signals the OAuth credmon when there is no work to do
- Fix rare condor_schedd crash when a $$() macro could not be expanded
- By default, put Docker jobs on hold when CPU architecture doesn't match
condor-24.0.2-0.766510.el9.src [11.7 MiB] Changelog by Tim Theisen (2024-10-31):
- Improved tracking and enforcement of disk usage by using LVM
- Enhancements to the htcondor CLI tool
- cgroup v2 support for tracking and enforcement of CPU and memory usage
- Leverage cgroups to hide GPUs not allocated to the job
- DAGMan can now produce job credentials when using direct submit
- New submit commands to aid in matching specific GPU requirements
- New implementation of the Python bindings, htcondor2 and classad2
- Improved default security configuration
- Significant reduction in memory and CPU usage on the Central Manager
- Support for GPUs using AMD's HIP 6 library
- Fix bugs when -divide or -repeat was used in GPU detection
- Proper error message and hold when Docker emits multi-line error message
- Fix issue where an unresponsive libvirtd blocked an EP from starting up
condor-24.0.1-1.el9.src [11.7 MiB] Changelog by Tim Theisen (2024-10-31):
- Improved tracking and enforcement of disk usage by using LVM
- Enhancements to the htcondor CLI tool
- cgroup v2 support for tracking and enforcement of CPU and memory usage
- Leverage cgroups to hide GPUs not allocated to the job
- DAGMan can now produce job credentials when using direct submit
- New submit commands to aid in matching specific GPU requirements
- New implementation of the Python bindings, htcondor2 and classad2
- Improved default security configuration
- Significant reduction in memory and CPU usage on the Central Manager
- Support for GPUs using AMD's HIP 6 library
- Fix bugs when -divide or -repeat was used in GPU detection
- Proper error message and hold when Docker emits multi-line error message
- Fix issue where an unresponsive libvirtd blocked an EP from starting up
condor-24.0.1-0.764240.el9.src [11.7 MiB] Changelog by Tim Theisen (2024-10-24):
- Bug fix for PID namespaces and condor_ssh_to_job on EL9
- Augment condor_upgrade_check to find unit suffixes in ClassAd expressions
condor-24.0.1-0.762642.el9.src [11.7 MiB] Changelog by Tim Theisen (2024-10-10):
- Backport all cgroup v2 fixes and enhancements from the 23.10.1 release

Listing created by Repoview-0.6.6-4.el7