Skip to navigation

Enhancement Advisory Red Hat Enterprise MRG Messaging and Grid Version 1.2

Advisory: RHEA-2009:1633-1
Type: Product Enhancement Advisory
Severity: N/A
Issued on: 2009-12-03
Last updated on: 2009-12-03
Affected Products: Red Hat Enterprise MRG v1 for Red Hat Enterprise Linux (version 5)

Details

Red Hat Enterprise MRG Messaging and Grid Update 1.2 is now available for
Red Hat Enterprise Linux 5.
This new release includes updated classads, condor, condor-ec2-enhanced,
condor-ec2-enhanced-hooks, condor-job-hooks, condor-low-latency,
condor-remote-configuration, python-qpid, qpid-java, qpidc, rhm,
mrg-grid-docs and sesame packages that fix bugs and add enhancements.

Red Hat Enterprise MRG (Messaging, Real-time and Grid) is a real-time IT
infrastructure for enterprise computing. MRG Messaging implements the
Advanced Message Queuing Protocol (AMQP) standard, adding persistence
options, kernel optimizations, and operating system services.

This update moves Red Hat Enterprise MRG to version 1.2 and includes
numerous bug fixes and enhancements.

Messaging enhancements include:

- access control lists (ACLs) can now be used when defining a queue with
the MRG Messaging Python client. (BZ#523794)

Messaging bug fixes include:

- when a node in a cluster failed, and was then brought back up, it was
attempting to restore using information from both the store, and the
running master node. This resulted in the node that was attempting to
rejoin failing. This has been corrected, so that only the first node
started in a cluster will restore from the store. All subsequent nodes
added to the cluster will discard the store data and will synchronize with
the master node in the cluster. Rejoining a running cluster now operates as
expected. (483807)
- some recovered flow-to-disk messages on replicated queues could cause an
invalid frame sequence to be sent over the bridge that is transferring
replication events. This caused an "unexpected command continuation frame"
error on the replication link. The frame sequencing order was corrected and
the link now conveys replication events as normal. (514563)
- if a queue had multiple consumers and a new member was added to the
queue, it inconsistently allocated messages to consumers. This could result
in one node delivering a message of which the shadow sessions were unaware.
Some nodes would then record an error and shut down on detecting the
inconsistency. The update process was corrected and the inconsistent
delivery no longer occurs. This prevents errors, and all nodes remain in
the cluster as expected. (516501)

Grid enhancements include:

- the Job Router now supports multiple hook keywords to the Job Router.
The JOB_ROUTER_HOOK_KEYWORD configuration option is used to define the
default hook keyword. The hook keyword is used for jobs routed by the job
router. (463430)
- messages produced by carod now carry the same MessageId as exists in work
submitting AMQP messages. This will assist in correlation by submitter.
(489009)
- a new feature called vmuni has been added for configuration of the VM
universe. It allows configuring Xen and KVM virtual machines. The startd
must be restarted (not just reconfigured) after configuration has been
applied. (491237)

Grid bug fixes include:

- added SCHEDD.QMF_STOREFILE = $(SPOOL)/.schedd_storefile to the High
Availability Scheduler configuration. Jobs and schedulers no longer appear
in cumin twice after a scheduler failover. (486484)
- if the MRG Messaging broker was restarted while low-latency was running
on a Grid execute node, the low-latency daemon would stop processing jobs
and crash. The daemon now checks for disconnections and attempts to
reconnect. This prevents the daemon from crashing and will resume
processing jobs once the broker is running again. (488998)
- HA_LOCK_HOLD_TIME and HA_POLL_PERIOD had default values that could cause
a range of problems with failover. HA Schedd lock period has been
shortened. HA_LOCK_HOLD_TIME now defaults to 300 seconds, and
HA_POLL_PERIOD to 60 seconds (these parameters could be changed to lower
values if faster fail-over is required), and failover now works more
reliably. (496227)

These updated packages include many other bug fixes and enhancements.
Users are directed to the Red Hat Enterprise MRG 1.2 Release Notes for
information on these changes:

http://www.redhat.com/docs/en-US/Red_Hat_Enterprise_MRG/

All Red Hat Enterprise MRG users are advised to install this updated
package which fixes the bugs and adds the enhancements noted in the Red Hat
Enterprise MRG 1.2 Release Notes.


Solution

Before applying this update, make sure that all previously-released
errata relevant to your system have been applied.

This update is available via Red Hat Network. Details on how to use
the Red Hat Network to apply this update are available at
http://kbase.redhat.com/faq/docs/DOC-11259

Updated packages

Red Hat Enterprise MRG v1 for Red Hat Enterprise Linux (version 5)

SRPMS:
classads-1.0.4-1.el5.src.rpm
File outdated by:  RHSA-2010:0773
    MD5: e189b58c48aac29f117f06c911311ecb
condor-7.4.1-0.7.el5.src.rpm
File outdated by:  RHBA-2011:0217
    MD5: f236d6eaa039b45307e5dd41ea873b68
condor-ec2-enhanced-1.0-18.el5.src.rpm
File outdated by:  RHSA-2010:0773
    MD5: f63caec353a3f2e10bddc86dbd40ef23
condor-ec2-enhanced-hooks-1.0-19.el5.src.rpm
File outdated by:  RHSA-2010:0773
    MD5: 9d42bba8402c76cfa01355e9e8b907c6
condor-low-latency-1.0-21.el5.src.rpm
File outdated by:  RHBA-2011:0217
    MD5: 1c507b5371a522dd165a4d9484cec398
condor-remote-configuration-1.0-23.el5.src.rpm     MD5: 93278234de37387c4d5ba779d0496fae
mrg-grid-docs-1.2-1.el5.src.rpm
File outdated by:  RHSA-2010:0773
    MD5: 96bd5d911edaa20dd99fa6e70fe251f7
python-qpid-0.5.752581-4.el5.src.rpm
File outdated by:  RHBA-2011:0217
    MD5: c2268615013a49dd1febfdb8458eb520
qpid-java-0.5.751061-9.el5.src.rpm
File outdated by:  RHBA-2011:1147
    MD5: 29057f682542726d3a3557bee6173fa4
qpidc-0.5.752581-34.el5.src.rpm
File outdated by:  RHSA-2010:0756
    MD5: 89f303ad615d58454bb2c58d9ac19f51
rhm-0.5.3206-27.el5.src.rpm
File outdated by:  RHSA-2010:0756
    MD5: 3abf7e4ae51e2792e18d38339587c05d
rhm-docs-0.5.756148-2.el5.src.rpm
File outdated by:  RHSA-2010:0773
    MD5: 379383b52d3308b39137176eb6072681
sesame-0.4.3153-2.el5.src.rpm
File outdated by:  RHBA-2011:0217
    MD5: c99f0cdc9223b762ddbd44ba83e313a8
 
IA-32:
classads-1.0.4-1.el5.i386.rpm
File outdated by:  RHSA-2010:0773
    MD5: 327e87961decc8eb81e34eafc22f01be
classads-devel-1.0.4-1.el5.i386.rpm
File outdated by:  RHSA-2010:0773
    MD5: 24300abfa768b899fe79abb0d0fc6bfd
classads-static-1.0.4-1.el5.i386.rpm
File outdated by:  RHSA-2010:0773
    MD5: a891b9541d5cd2f749926cfd6ddb4d89
condor-7.4.1-0.7.el5.i386.rpm
File outdated by:  RHBA-2011:0217
    MD5: 4a8bbc217aa077944669d083b19a68e8
condor-ec2-enhanced-1.0-18.el5.noarch.rpm
File outdated by:  RHSA-2010:0773
    MD5: 10d6ebcad3bb15d24238c4849aeefb90
condor-ec2-enhanced-hooks-1.0-19.el5.noarch.rpm
File outdated by:  RHSA-2010:0773
    MD5: 75250e6258e284142cd6f21cb6286ce2
condor-job-hooks-1.0-13.el5.noarch.rpm
File outdated by:  RHBA-2011:0217
    MD5: 3da67fc9fade06839e28aac197a541eb
condor-kbdd-7.4.1-0.7.el5.i386.rpm
File outdated by:  RHBA-2011:0217
    MD5: 9b853c4de257034329f88676c1aaa202
condor-low-latency-1.0-21.el5.noarch.rpm
File outdated by:  RHBA-2011:0217
    MD5: d3c8321c81d9ce0cceebfa0df18ab7a4
condor-qmf-plugins-7.4.1-0.7.el5.i386.rpm
File outdated by:  RHSA-2009:1689
    MD5: 0e6ca4bcd8140ddf3e1d48d1430ee3f8
condor-remote-configuration-1.0-23.el5.noarch.rpm     MD5: 1161c9828f5b71570d2b2c20eb5ab1b3
condor-remote-configuration-server-1.0-23.el5.noarch.rpm     MD5: ec06bc458eeba0d797dbfef947d5859e
condor-vm-gahp-7.4.1-0.7.el5.i386.rpm
File outdated by:  RHBA-2011:0217
    MD5: c7a8142e45a007bddcd78169c32d05b1
mrg-grid-docs-1.2-1.el5.noarch.rpm
File outdated by:  RHSA-2010:0773
    MD5: 3b96d2e8b03d80a8a29b68177962a344
python-condor-ec2-enhanced-hooks-common-1.0-19.el5.noarch.rpm     MD5: fc00f90dfce6f38dabaa35fb9e6abeb4
python-condor-job-hooks-common-1.0-13.el5.noarch.rpm     MD5: e80bf8d574caa4adbaaea1a9c4b4fa4c
python-qpid-0.5.752581-4.el5.noarch.rpm
File outdated by:  RHBA-2011:0217
    MD5: 4312a394c41b239ef587945d6deadbd8
qmf-0.5.752581-34.el5.i386.rpm
File outdated by:  RHBA-2011:1147
    MD5: f8a72dfdfeb2ea6f642c3a6e19257b29
qmf-devel-0.5.752581-34.el5.i386.rpm
File outdated by:  RHBA-2011:1147
    MD5: 6ac0179de98dc4f444bb79663a9efeb7
qpid-java-client-0.5.751061-9.el5.noarch.rpm
File outdated by:  RHBA-2011:1147
    MD5: b92fd4742f1b84b8c7825dfa8cbda712
qpid-java-common-0.5.751061-9.el5.noarch.rpm
File outdated by:  RHBA-2011:1147
    MD5: e6b285c706ef87b4d99b2043da34e322
qpid-java-qman-0.5.751061-9.el5.noarch.rpm     MD5: 670d1aa7deee3120305767de6f8b17c3
qpidc-0.5.752581-34.el5.i386.rpm
File outdated by:  RHSA-2010:0756
    MD5: aad9159040b9918ed3c9983bd74efd97
qpidc-devel-0.5.752581-34.el5.i386.rpm
File outdated by:  RHSA-2010:0756
    MD5: a62e2531415e1e1c9e28031ebda347cb
qpidc-perftest-0.5.752581-34.el5.i386.rpm
File outdated by:  RHSA-2010:0756
    MD5: 357dac6f7113dccea4aaf97e4eb880d4
qpidc-rdma-0.5.752581-34.el5.i386.rpm
File outdated by:  RHSA-2010:0756
    MD5: d1239114a54fec00818a2d9ed03379d9
qpidc-ssl-0.5.752581-34.el5.i386.rpm
File outdated by:  RHSA-2010:0756
    MD5: e02289f7e3bbfdb2a16f88beb4591288
qpidd-0.5.752581-34.el5.i386.rpm
File outdated by:  RHSA-2010:0756
    MD5: 40906092bfd423b461fd09484825d291
qpidd-acl-0.5.752581-34.el5.i386.rpm
File outdated by:  RHSA-2010:0756
    MD5: 3a4029761e173cddb5829b59b875df74
qpidd-cluster-0.5.752581-34.el5.i386.rpm
File outdated by:  RHSA-2010:0756
    MD5: d24609682d49f93f156bd48e4b3f60cf
qpidd-devel-0.5.752581-34.el5.i386.rpm
File outdated by:  RHSA-2010:0756
    MD5: fe0beab30775ab469fb0acd739e725b2
qpidd-rdma-0.5.752581-34.el5.i386.rpm
File outdated by:  RHSA-2010:0756
    MD5: 07c9b49d8d82170a4b4213ef1ed65f9b
qpidd-ssl-0.5.752581-34.el5.i386.rpm
File outdated by:  RHSA-2010:0756
    MD5: 3ae2c659471b0dbe89728149b582d9c4
qpidd-xml-0.5.752581-34.el5.i386.rpm
File outdated by:  RHSA-2010:0756
    MD5: 72f31a7a2d44850cbb217d8e7b20e3fc
rhm-0.5.3206-27.el5.i386.rpm
File outdated by:  RHSA-2010:0756
    MD5: 4a15e31e0ebe559591a924fcfc637140
rhm-docs-0.5.756148-2.el5.noarch.rpm
File outdated by:  RHSA-2010:0773
    MD5: 819d21849c3e39f290df7af398048a20
sesame-0.4.3153-2.el5.i386.rpm
File outdated by:  RHBA-2011:0217
    MD5: f3e6d96b9e09faac921a74de267b4196
 
x86_64:
classads-1.0.4-1.el5.x86_64.rpm
File outdated by:  RHSA-2010:0773
    MD5: 46cc6eb7a2d4f960768a2917af55f7e1
classads-devel-1.0.4-1.el5.x86_64.rpm
File outdated by:  RHSA-2010:0773
    MD5: 744150a0aba7aa93ac71e9c27fd64f62
classads-static-1.0.4-1.el5.x86_64.rpm
File outdated by:  RHSA-2010:0773
    MD5: 3b060891eb444d067d3b301cb20aae7a
condor-7.4.1-0.7.el5.x86_64.rpm
File outdated by:  RHBA-2011:0217
    MD5: b44aec3fdb585ba4fded2a49c471d7a5
condor-ec2-enhanced-1.0-18.el5.noarch.rpm
File outdated by:  RHSA-2010:0773
    MD5: 10d6ebcad3bb15d24238c4849aeefb90
condor-ec2-enhanced-hooks-1.0-19.el5.noarch.rpm
File outdated by:  RHSA-2010:0773
    MD5: 75250e6258e284142cd6f21cb6286ce2
condor-job-hooks-1.0-13.el5.noarch.rpm
File outdated by:  RHBA-2011:0217
    MD5: 3da67fc9fade06839e28aac197a541eb
condor-kbdd-7.4.1-0.7.el5.x86_64.rpm
File outdated by:  RHBA-2011:0217
    MD5: 6743ac36d9a0d852fe3963939c846438
condor-low-latency-1.0-21.el5.noarch.rpm
File outdated by:  RHBA-2011:0217
    MD5: d3c8321c81d9ce0cceebfa0df18ab7a4
condor-qmf-plugins-7.4.1-0.7.el5.x86_64.rpm
File outdated by:  RHSA-2009:1689
    MD5: 0f37d8e29da95d316f5073bce9628763
condor-remote-configuration-1.0-23.el5.noarch.rpm     MD5: 1161c9828f5b71570d2b2c20eb5ab1b3
condor-remote-configuration-server-1.0-23.el5.noarch.rpm     MD5: ec06bc458eeba0d797dbfef947d5859e
condor-vm-gahp-7.4.1-0.7.el5.x86_64.rpm
File outdated by:  RHBA-2011:0217
    MD5: 2afb7c5a12107829218b0c17e65b0200
mrg-grid-docs-1.2-1.el5.noarch.rpm
File outdated by:  RHSA-2010:0773
    MD5: 3b96d2e8b03d80a8a29b68177962a344
python-condor-ec2-enhanced-hooks-common-1.0-19.el5.noarch.rpm     MD5: fc00f90dfce6f38dabaa35fb9e6abeb4
python-condor-job-hooks-common-1.0-13.el5.noarch.rpm     MD5: e80bf8d574caa4adbaaea1a9c4b4fa4c
python-qpid-0.5.752581-4.el5.noarch.rpm
File outdated by:  RHBA-2011:0217
    MD5: 4312a394c41b239ef587945d6deadbd8
qmf-0.5.752581-34.el5.x86_64.rpm
File outdated by:  RHBA-2011:1147
    MD5: 5a078ad8e4d8d57052fbf906cf0413a3
qmf-devel-0.5.752581-34.el5.x86_64.rpm
File outdated by:  RHBA-2011:1147
    MD5: 42c1caa9be50ee63b3cb5cf8453ccfe8
qpid-java-client-0.5.751061-9.el5.noarch.rpm
File outdated by:  RHBA-2011:1147
    MD5: b92fd4742f1b84b8c7825dfa8cbda712
qpid-java-common-0.5.751061-9.el5.noarch.rpm
File outdated by:  RHBA-2011:1147
    MD5: e6b285c706ef87b4d99b2043da34e322
qpid-java-qman-0.5.751061-9.el5.noarch.rpm     MD5: 670d1aa7deee3120305767de6f8b17c3
qpidc-0.5.752581-34.el5.x86_64.rpm
File outdated by:  RHSA-2010:0756
    MD5: d64e66d8d2d243fef5d38cef3a38c46a
qpidc-devel-0.5.752581-34.el5.x86_64.rpm
File outdated by:  RHSA-2010:0756
    MD5: d8424be3a9fdbc3756880320ebd3b247
qpidc-perftest-0.5.752581-34.el5.x86_64.rpm
File outdated by:  RHSA-2010:0756
    MD5: f98629d5554bb5a1becadee05beb8f61
qpidc-rdma-0.5.752581-34.el5.x86_64.rpm
File outdated by:  RHSA-2010:0756
    MD5: c9d1f17fcfe55f43e3c1e37e8549746a
qpidc-ssl-0.5.752581-34.el5.x86_64.rpm
File outdated by:  RHSA-2010:0756
    MD5: 98e6594954885d0132aedab69357bb6c
qpidd-0.5.752581-34.el5.x86_64.rpm
File outdated by:  RHSA-2010:0756
    MD5: c2ca32f27b41173cbcb68b272ee9d1cf
qpidd-acl-0.5.752581-34.el5.x86_64.rpm
File outdated by:  RHSA-2010:0756
    MD5: 33095cfd03af56b82bcdface6809d3be
qpidd-cluster-0.5.752581-34.el5.x86_64.rpm
File outdated by:  RHSA-2010:0756
    MD5: fc8174e89f5ef270d1afdf6472572a32
qpidd-devel-0.5.752581-34.el5.x86_64.rpm
File outdated by:  RHSA-2010:0756
    MD5: 975b835555cc85ef8ff1009862ab5930
qpidd-rdma-0.5.752581-34.el5.x86_64.rpm
File outdated by:  RHSA-2010:0756
    MD5: 033f15d357ca3543262c81ca5311d07c
qpidd-ssl-0.5.752581-34.el5.x86_64.rpm
File outdated by:  RHSA-2010:0756
    MD5: 84118461de01d787b2f6da8ae4c6dec6
qpidd-xml-0.5.752581-34.el5.x86_64.rpm
File outdated by:  RHSA-2010:0756
    MD5: aa8b65e65255b8932c61fbe1016921c7
rhm-0.5.3206-27.el5.x86_64.rpm
File outdated by:  RHSA-2010:0756
    MD5: 9599f1c06b7cf6d3ce1ca880a4962c16
rhm-docs-0.5.756148-2.el5.noarch.rpm
File outdated by:  RHSA-2010:0773
    MD5: 819d21849c3e39f290df7af398048a20
sesame-0.4.3153-2.el5.x86_64.rpm
File outdated by:  RHBA-2011:0217
    MD5: d34de5f3af91b66043a438a1590b64f5
 
(The unlinked packages above are only available from the Red Hat Network)

Bugs fixed (see bugzilla for more information)

457495 - condor_submit -dump segfaults with multiple jobs in cluster
459615 - Job Hooks do not provide correct JobStatus in ClassAds
463430 - Implement multiple hook keywords for Job Router
478475 - RFE: Add new configuration parameters for proxy_userid and proxy_passwd
480992 - hook_prepare_job.py has an error handling bug on line 62
483807 - resolve join state for store recover in cluster for joining nodes
485090 - SOAP memory leak on daemon reconfig
486484 - condor_ha_scheduler configuration missing QMF_STOREFILE
486487 - Stale .schedd_address and .schedd_classad
488996 - Type information lost when passing ad through carod
488998 - carod cannot handle broker restart
489009 - [RFE] Message Id should be propagated to messages carod sends
489557 - Crash in QMF Management Agent (c++) during connection shutdown
491237 - [RFE] condor_configure_node should have a option for configuring the virtual machine universe
491826 - unexpected output from carod with parameter --help
493340 - condor_configure_node does not prompt for HA Schedd name
493559 - Default logging level should be WARN not DEBUG
495588 - VM Universe documentation says VM_MEMORY is megabytes not bytes.
495685 - HOSTALLOW_WRITE denied between HA Schedulers
496227 - Remote Config: HA Schedd lock period too long
496432 - EventLog.lock fd leak and submission events are bogus
497854 - Condor tools only consult local address or daemon classad files
498497 - Startd crash can leak VMs
499826 - master termination not stopping HA daemon acquisition
501537 - qpidd should shut down immediately on loss of quorum.
505274 - Large durable messages that 'flow to disk', are not recovered correctly
506544 - condor_restart/reconfig can't handle all daemons
508834 - low latency test (cmd_args.py) raises exception: NameError conf_file
510135 - Cluster install openais dependency
514487 - Store capacity errors result in hung cluster
514563 - "Unexpected command continuation frame" error on replication link
514568 - [store] JERR_LFMGR_BADAEFNUMLIM thrown when using python tools to create queue containing 16 or more journal files
514686 - Flow to disk on replicated queues can crash broker
514730 - cron_day_of_week values in job files are ignored
516501 - Inconsistent error from session-completed on two node cluster with JMS clients
518236 - EC2E generated queue names too long
518281 - condor_restart can't handle HA Central Manager
518301 - condor_status -any doesn't show a collector
518410 - Seg fault in clustered broker where ttl is used in conjunction with lvq
518603 - Closing a connection via management command does not release sessions
519437 - Race condition can cause crash in collector and schedd if a query workers are enabled
522292 - DAGMan/DAG submission version compatibility improvements
522495 - Bug in ACL processing of exchangeBound()
522502 - ACL processing does not handle wildcards correctly
522506 - ACL issue - binding allowed to a queue when it should be denied
523242 - Missing newline in condor_status -negotiator output
523245 - StarterLog complains of deprecated config: EXECUTE_LOGIN_IS_DEDICATED
523482 - Slots left Claimed Idle when there are no jobs
523806 - No-op syscalls create schedd bottlenecks on shared file systems
524664 - Large messages cause hangs and crashes when using digest-md5 and security layer (ssf 128)
525470 - VM configuration status changes are not immediately visible in condor_status
525813 - Move Flow to disk from BDB to journal
526847 - condor_startd SEGV when deleting dynamic slots
526868 - connecting to a disabled network address takes too long to fail
527801 - 2GB filesize limit for log files
528015 - Crash in condor_schedd/QMF Agent on handling SIGQUIT
528067 - Expected job lease duration != measured
528544 - condor_master hangs on shutdown
529489 - channel collision renders pre-existing session unusable on client
530571 - HOSTALLOW -> ALLOW in configuration
530635 - Rename XEN_BRIDGE_SCRIPT & XEN_SCRIPT to VM_BRIDGE_SCRIPT & VM_SCRIPT
533361 - Released message can hamper correct browsing of the queue
533431 - inconsistent message positions on node that joins cluster when unacknowledged messages exist


References



These packages are GPG signed by Red Hat for security. Our key and details on how to verify the signature are available from:
https://www.redhat.com/security/team/key/#package

The Red Hat security contact is secalert@redhat.com. More contact details at http://www.redhat.com/security/team/contact/