Newer
Older
(see http://wiki.lustre.org/index.php?title=Patchless_Client)
2.6.16 - 2.6.21 vanilla (kernel.org)
* Recommended e2fsprogs version: 1.40.4-cfs1
* RHEL 4 and RHEL 5/SLES 10 clients behaves differently on 'cd' to a
removed cwd "./" (refer to Bugzilla 14399).
Severity : normal
Bugzilla : 13375
Descriptoin: make lov_create() will not stuck in obd_statfs_rqset()
Details : If an OST is down the MDS will hang indefinitely in
obd_statfs_rqset() waiting for the statfs data. While for
MDS QOS usage of statfs, it should not stuck in waiting.
Severity : enhancement
Bugzilla : 14288
Description: Update to RHEL4 U6 kernel-2.6.9-67.EL.
Severity : enhancement
Bugzilla : 14368
Description: Update to RHEL5 latest kernel-2.6.18-53.1.4.el5.
Severity : normal
Bugzilla : 14136
Description: make mgs_setparam() handle fsname containing dash
Details : fsname containing a dash does not work with lctl conf_param
Severity : enhancement
Bugzilla : 14388
Description: Update to SLES10 SP1 latest kernel-2.6.16.54-0.2.3.
Severity : enhancement
Bugzilla : 14289
Description: Update to RHEL5 Update-1 kernel 2.6.18-53.el5.
Severity : major
Bugzilla : 14260
Frequency : rare, at shutdown
Description: access already free / zero obd_namespace.
Details : if client_disconnect_export was called without force flag set,
and exist connect request in flight, this can produce access to
NULL pointer (or already free pointer) when connect_interpret
store ocd flags in obd_namespace.
Severity : minor
Bugzilla : 14418
Frequency : only at startup
Description: not alloc memory with spinlock held.
Details : allocation memory with GFP_KERNEL can produce sleep deadlock,
Severity : enhancement
Bugzilla : 12211
Description: make lustre randomly fail allocating memory
Details : Make lustre randomly failed allocating memory for testing purpose.
Severity : enhancement
Bugzilla : 12702
Description: lost problems with lov objid file
Details : Fixes some scability and access to not inited memory problems
Severity : major
Frequency : always
Bugzilla : 14270
Description: lfs find does not continue on file error
Details : Continue other files processing when a file/dir is absent.
Severity : normal
Bugzilla : 11791
Description: Inconsistent usage of lustre_pack_reply()
Details : Standardize the usage of lustre_pack_reply() such that it
Severity : major
Frequency : occasional
Bugzilla : 13917
Description: MDS hang or stay in waiting lock
Details : If client receive lock with CBPENDING flag ldlm need send lock
cancel as separate rpc, to avoid situation when cancel request
can't processed due all i/o threads stay in wait lock.
Bugzilla : 13969
Description: Update to RHEL5 kernel 2.6.18-8.1.15.el5.
Severity : normal
Bugzilla : 13874
Description: Update to SLES10 SP1 kernel 2.6.16.53-0.16
Severity : normal
Bugzilla : 13889
Description: Update to SLES9 kernel-2.6.5-7.287.3.
Severity : normal
Bugzilla : 14041
Description: Update to RHEL4 latest kernel.
Severity : enhancement
Bugzilla : 13690
Description: Build SLES10 patchless client fails
Details : The configure was broken by run ./configure with
--with-linux-obj=.... argument for patchless client. When the
configure use --with-linux-obj, the LINUXINCLUDE= -Iinclude
can't search header adequately. Use absolute path such as
-I($LINUX)/include instead.
Severity : normal
Bugzilla : 13888
Description: interrupt oig_wait produce painc on resend.
Details : brw_redo_request can be used for resend requests from ptlrpcd and
private set, and this produce situation when rq_ptlrpcd_data not
copyed to new allocated request and triggered LBUG on assert
req->rq_ptlrpcd_data != NULL. But this member used only for wakeup
ptlrpcd set if request is changed and can be safety changed to use
rq_set directly.
Severity : normal
Bugzilla : 13497
Description: LASSERT_{REQ,REP}SWAB macros are buggy
Details : If SWAB_PARANOIA is disabled, the LASSERT_REQSWAB and
LASSERT_REPSWAB macros become no-ops, which is incorrect. Drop
these macros and replace them with their difinitions instead.
Severity : normal
Bugzilla : 13521
Description: Update kernel patches for SLES10 2.6.16.53-0.8.
Details : Update which_patch & target file for SLES10 latest kernel.
Bugzilla : 12411
Description: Remove client patches from SLES 10 kernel.
Details : This causes SLES 10 clients to behave as patchless clients
Severity : enhancement
Bugzilla : 2262
Description: self-adjustable client's lru lists
Details : use adaptive algorithm for managing client cached locks lru
lists according to current server load, other client's work
pattern, memory activities, etc. Both, server and client
side namespaces provide number of proc tunables for controlling
things
Severity : enhancement
Bugzilla : 13641
Description: light-weight GSS support
Details : Support krb5n and krb5a mode, which keep Kerberos 5 authentication
and reduce performance overhead.
Severity : enhancement
Bugzilla : 11832
Description: Linux keyring support
Details : Support using service of Linux keyring for Lustre GSS internal
context refresh/cache mechanism.
Severity : normal
Bugzilla : 12186
Description: Fix errors in lfs documentation
Details : Fixes man pages
Severity : normal
Bugzilla : 12606
Description: don't use GFP_* in generic Lustre code.
Details : Use cfs_alloc_* functions and CFS_* flags for code portability.
Severity : normal
Bugzilla : 12333
Description: obdclass is limited by single OBD_ALLOC(idarray)
Details : replace OBD_ALLOC/OBD_FREE with OBD_VMALLOC/OBD_VFREE
Severity : normal
Bugzilla : 13006
Description: warnings with build patchless client with vanila 2.6.19 and up
Details : change old ctl_table style and replace ctl_table/ctl_table_header
with cfs_sysctl_table_t/cfs_sysctl_table_header_t
Severity : normal
Bugzilla : 13177
Frequency : Only for SLES
Description: sanity_quota fail test_1
Details : There are multiple occurences of $TSTUSR in SLES's /etc/group
Severity : normal
Bugzilla : 13249
Frequency : Only for SLES9
Description: Kernel patches for SLES9 2.6.5-7.286 kernel
Details : Update target/ChangeLog/which_patch .
Severity : normal
Bugzilla : 13170
Frequency : Only for test_10 in sanity-quota.sh
Description: a bug in quota test code
Details : checking a test flag in a wrong place
Severity : normal
Bugzilla : 13171
Frequency : Only for test_13 in sanity-quota.sh
Description: a bug in quota test script
Details : shouldn't check overfull allocated quota
Severity : normal
Bugzilla : 12955
Description: jbd statistics
Details : Port older jbd statistics patch for sles10
Severity : enhancement
Bugzilla : 11721
Description: Add printing inode info into message about error in writepage.
Severity : normal
Bugzilla : 11974
Frequency : Rare
Description: reply_lock_interpret crash due to race with it and lock cancel.
Details : Do not replay locks that are being cancelled. Do not reference
locks by their address during replay, just by their handle.
Severity : normal
Bugzilla : 13103
Frequency : When flocks are used.
Description: assertion failure in ldlm_cli_enquque_fini for non NULL lock.
Details : Flock locks might destroy just granted lock if it could be merged
with another existing flock, this is done in completion handler,
so teach ldlm_cli_enquque_fini that this is a valid case for
flock locks.
Severity : minor
Bugzilla : 13276
Frequency : rare
Description: Oops in read and write path when failing to allocate lock.
Details : Check if lock allocation failed and return error back.
Severity : normal
Bugzilla : 11679
Description: lstripe command fails for valid OST index
Details : The stripe offset is compared to lov->desc.ld_tgt_count
instead of lov->desc.ld_active_tgt_count.
Severity : normal
Bugzilla : 12584
Description: sanity.sh failed test 103
Details : RHEL mis-interpret setfacl "-X" param, so we won't test setfacl
Severity : normal
Bugzilla : 12743
Description: df doesn't work properly if diskfs blocksize != 4K
Details : Choose biggest blocksize of OST's as the LOV's blocksize.
Severity : normal
Bugzilla : 12836
Description: lfs find on -1 stripe looping in lsm_lmm_verify_common()
Details : Avoid lov_verify_lmm_common() on directory with -1 stripe count.
Severity : major
Bugzilla : 12932
Description: obd_health_check_timeout too short
Details : set obd_health_check_timeout as 1.5x of obd_timeout
Severity : normal
Bugzilla : 12192
Description: llapi_file_create() does not allow some changes
Details : add llapi_file_open() that allows specifying the mode and
open flags, and also returns an open file handle.
Severity : normal
Bugzilla : 11248
Description: merge and cleanup kernel patches.
Details : Remove mnt_lustre_list in vfs_intent-2.6-rhel4.patch.
Severity : normal
Bugzilla : 10657
Description: Add journal checksum support.(Kernel part)
Details : The journal checksum feature adds two new flags i.e
JBD2_FEATURE_COMPAT_CHECKSUM. JBD2_FEATURE_CHECKSUM flag
indicates that the commit block contains the checksum for
the blocks described by the descriptor blocks. Now commit
record can be sent to disk without waiting for descriptor
blocks to be written to disk. This behavior is controlled
using JBD2_FEATURE_ASYNC_COMMIT flag.
Severity : minor
Bugzilla : 12446
Description: OSS needs mutliple precreate threads
Details : Add ability to start more than one create thread per OSS.
Severity : normal
Bugzilla : 13362
Description: Sanity.sh test_65k failed: setstripe should have succeeded
Details : execute lfs setstripe on client
Severity : major
Bugzilla : 12223
Description: mds_obd_create error creating tmp object
Details : When the user sets quota on root, llog will be affected and can't
Severity : normal
Frequency : Always on ia64 patchless client, and possibly others.
Bugzilla : 12826
Description: Add EXPORT_SYMBOL check for node_to_cpumask symbol.
Details : This allows the patchless client to be loaded on architectures
Severity : normal
Bugzilla : 13039
Description: RedHat Update kernel for RHEL5
Details : Add the kernel config file for RHEL5.
Severity : normal
Bugzilla : 13039
Description: RedHat Update kernel for RHEL5
Details : Modify the kernel config file more closer RHEL5.
Severity : normal
Bugzilla : 13360
Description: Build failure against Centos5 (RHEL5)
Details : Define PAGE_SIZE when it isn't present.
Severity : minor
Bugzilla : 13363
Description: test_7 on sanity-quota.sh can't be used on separate machines
Details : add facet to handle it
Severity : normal
Bugzilla : 13030
Description: "ll_intent_file_open()) lock enqueue: err: -13" with nfs
Details : with NFS, the anon dentry's parent was set to itself in
d_alloc_anon(), so in MDS, we use rec->ur_fid1 to find the
corresponding dentry other than use rec->ur_name.
Severity : enhancement
Bugzilla : 12786
Description: lfs setstripe enhancement
Details : Make lfs setstripe understand 'k', 'm' and 'g' for stripe size.
Description: enable data checksumming by default
Details : enable checksum by default, allow --disable-checksum
configure option and "-o nochecksum" mount option. Checksums
can also be disabled at runtime via $LPROC/osc/*/checksum_pages.
Severity : normal
Bugzilla : 11802
Description: lustre support for RHEL5
Details : Add support for RHEL5.
Severity : normal
Bugzilla : 12459
Description: Client eviction due to failover config
Details : after a connection loss, the lustre client should attempt to
reconnect to the last active server first before trying the
other potential connections.
Severity : minor
Bugzilla : 12588
Description: when mds and osts use different quota unit(32bit and 64bit),
quota will be released repeatly.
Details : void sending multiple quota reqs to mds, which will keep the
status between the reqs.
Severity : normal
Bugzilla : 13125
Description: osts not allocated evenly to files
Details : change the condition to increase offset_idx
Severity : critical
Frequency : Always for filesystems larger than 2TB on 32-bit systems.
Bugzilla : 13547
Description: Data corruption for OSTs that are formatted larger than 2TB
on 32-bit servers.
Details : When generating the bio request for lustre file writes the
sector number would overflow a temporary variable before being
used for the IO. The data reads correctly from Lustre (which
will overflow in a similar manner) but other file data or
filesystem metadata may be corrupted in some cases.
Severity : normal
Bugzilla : 11230
Description: Tune the kernel for good SCSI performance.
Details : Set the value of /sys/block/{dev}/queue/max_sectors_kb
to the value of /sys/block/{dev}/queue/max_hw_sectors_kb
in mount_lustre.

Nikita Danilov
committed
Severity : cleanup
Bugzilla : 13532
Description: rewrite ext2-derived code in obdclass/uuid.c
Details : rewrite inherited code (uuid parsing code from ext2 utils)
from scratch preserving functionality.
Severity : normal
Bugzilla : 13600
Description: "lfs find -obd UUID" prints directories
Details : "lfs find -obd UUID" will return all directory names instead
of just file names. It is incorrect because the directories
do not reside on the OSTs.
Severity : minor
Bugzilla : 2369
Description: use i_size_read and i_size_write in 2.6 port
Details : replace inode->i_size access with i_size_read/write()
Severity : normal
Frequency : when removing large files
Bugzilla : 13181
Description: scheduling issue during removal of large Lustre files
Details : Don't take the BKL in fsfilt_ext3_setattr() for 2.6 kernels.
It causes scheduling issues when removing large files (17TB in the
present case).
Severity : normal
Frequency : only with liblustre clients on XT3
Bugzilla : 12418
Description: evictions taking too long
Details : allow llrd to evict clients directly on OSTs
Severity : normal
Frequency : only on ppc
Bugzilla : 12234
Description: /proc/fs/lustre/devices broken on ppc
Details : The patch as applied to 1.6.2 doesn't look correct for all arches.
We should make sure the type of 'index' is loff_t and then cast
explicitly as needed below. Do not assign an explicitly cast
loff_t to an int.
Severity : normal
Frequency : only for rhel5
Bugzilla : 13616
Description: Kernel patches update for RHEL5 2.6.18-8.1.10.el5.
Details : Modify the target file & which_kernel.
Severity : enhancement
Bugzilla : 10786
Description: omit set fsid for export NFS
Details : fix set/restore device id for avoid EMFILE error and mark lustre fs
as FS_REQUIRES_DEV for avoid problems with generate fsid.
Severity : normal
Bugzilla : 13304
Frequency : Always, for kernels after 2.6.16
Description: Fix warning idr_remove called for id=.. which is not allocated.
Details : Last kernels save old s_dev before kill super and not allow
to restore from callback - restore it before call kill_anon_super.
Severity : minor
Bugzilla : 12948
Description: buffer overruns could theoretically occur
Details : llapi_semantic_traverse() modifies the "path" argument by
appending values to the end of the origin string, and a
overrun may occur. Adding buffer overrun check in liblustreapi.
Bugzilla : 13334
Description: Fix error on 'ls .' at the top of the Lustre mount.
Details : Don't revalidate dentry if it is a root dentry.
Severity : normal
Bugzilla : 13518
Description: Kernel patches update for RHEL4 2.6.9-55.0.6.
Details : Modify vm-tunables-rhel4.patch.
Severity : normal
Bugzilla : 13452
Description: Kernel config for 2.6.18-vanilla.
Details : Modify targets/2.6-vanilla.target.in.
Add config file kernel-2.6.18-2.6-vanilla-i686.config.
Add config file kernel-2.6.18-2.6-vanilla-i686-smp.config.
Add config file kernel-2.6.18-2.6-vanilla-x86_64.config.
Add config file kernel-2.6.18-2.6-vanilla-x86_64-smp.config.
Severity : major
Bugzilla : 11710
Description: improve handling recoverable errors
Details : if request processig with error which can be recoverable on server
request should be resend, otherwise page released from cache and
marked as error.
Severity : critical
Bugzilla : 13751
Description: Kernel patches update for RHEL5 2.6.18-8.1.14.el5.
Details : Modify target file & which_patch.
A flaw was found in the IA32 system call emulation provided
on AMD64 and Intel 64 platforms. An improperly validated 64-bit
value could be stored in the %RAX register, which could trigger an
out-of-bounds system call table access. An untrusted local user
could exploit this flaw to run code in the kernel
(ie a root privilege escalation). (CVE-2007-4573).
Severity : major
Bugzilla : 13093
Description: O_DIRECT bypasses client statistics.
Details : When running with O_DIRECT I/O, neither the client rpc_stats nor
read_ahead_stats were updated. copied the stats section from
osc_send_oap_rpc() into async_internal().
Severity : normal
Bugzilla : 13454
Description: Add jbd statistics patch for RHEL5 and 2.6.18-vanilla
Severity : minor
Bugzilla : 13732
Description: change order of libsysio includes
Details : '#include sysio.h' should always come before '#include xtio.h'
Severity : normal
Bugzilla : 11673
Description: handle "serious error: objid * already exists" more gracefully
Details : If LAST_ID value on disk is smaller than the objects existing in
the O/0/d* directories, it indicates disk corruption and causes an
LBUG(). If the object is 0-length, then we should use the existing
object. This will help to avoid a full fsck in most cases.
Severity : enhancement
Bugzilla : 13207
Description: adapt the lustre_config script to support the upgrade case
Details : Add "-u" option for lustre_config script to support upgrading 1.4
Description: To avoid grant space > avaible space when the disk is almost
full. Without this patch you might see the error "grant XXXX >
available" or some LBUG about grant, when the disk is almost
full.
Details : In filter_check_grant, for non_grant cache write, we should
check the left space by if (*left > ungranted + bytes), instead
of (*left > ungranted), because only we are sure the left space
is enough for another "bytes", then the ungrant space should be
increase. In client, we should update cl_avail_grant only there
is OBD_MD_FLGRANT in the reply.
Severity : critical
Bugzilla : 13748
Description: Update RHEL 4 kernel to fix local root privilege escalation.
Details : Update to the latest RHEL 4 kernel to fix the vulnerability
described in CVE-2007-4573. This problem could allow untrusted
local users to gain root access.
Severity : normal
Frequency : when using O_DIRECT and quotas
Bugzilla : 13930
Description: Incorrect file ownership on O_DIRECT output files
Details : block usage reported by 'lfs quota' does not take into account
files that have been written with O_DIRECT.
Severity : normal
Frequency : always
Bugzilla : 13976
Description: touch file failed when fs is not full
Details : OST in recovery should not be discarded by MDS in alloc_qos(),
Severity : normal
Bugzilla : 11301
Description: parallel lock callbacks
Details : Instead of sending blocking and completion callbacks as separated
Severity : normal
Frequency : only for Cray XT3
Bugzilla : 12829/13455
Description: Changing primary group doesn't change the group lustre assigns to
a file
Details : When CRAY_XT3 is defined, the fsgid supplied by the client is
overridden with the primary group provided by the group upcall,
whereas the supplied fsgid can be trusted if it is in the list of
supplementary groups returned by the group upcall.
Severity : enhancement
Bugzilla : 14398
Description: Allow masking D_WARNING, D_ERROR messages from console
Details : Console messages can now be disabled via lnet.printk.
--------------------------------------------------------------------------------
2007-08-10 Cluster File Systems, Inc. <info@clusterfs.com>
* version 1.6.1
* Support for kernels:
2.6.5-7.283 (SLES 9),
2.6.9-55.EL (RHEL 4),
2.6.16.46-0.14 (SLES 10),
2.6.18.8 vanilla (kernel.org)
* Client support for unpatched kernels:
(see http://wiki.lustre.org/index.php?title=Patchless_Client)
2.6.16 - 2.6.22 vanilla (kernel.org)
* Due to recently discovered recovery problems, we do not recommend
using patchless RHEL 4 clients with this or any earlier release.
* Recommended e2fsprogs version: 1.39.cfs8
* Note that reiserfs quotas are disabled on SLES 10 in this kernel.
* Starting with this release, the ldiskfs backing filesystem required
by Lustre is now in its own package, lustre-ldiskfs. This package
should be installed. It is versioned separately from Lustre and
may be released separately in future.
Severity : minor
Bugzilla : 13147
Description: block reactivating mgc import until all deactivates complete
Details : Fix race when failing back MDT/MGS to itself (testing)
Severity : minor
Frequency : at statup only
Bugzilla : 12860
Description: mds_lov_synchronize race leads to various problems
Details : simultaneous MDT->OST connections at startup can cause the
sync to abort, leaving the OSC in a bad state.
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
Severity : enhancement
Bugzilla : 12194
Description: add optional extra BUILD_VERSION info
Details : add a new environment variable (namely LUSTRE_VERS) which allows
to override the lustre version.
Severity : normal
Frequency : 2.6.18 servers only
Bugzilla : 12546
Description: ll_kern_mount() doesn't release the module reference
Details : The ldiskfs module reference count never drops down to 0
because ll_kern_mount() doesn't release the module reference.
Severity : normal
Frequency : rare
Bugzilla : 12470
Description: server LBUG when using old ost_num_threads parameter
Details : Accept the old ost_num_threads parameter but warn that it
is deprecated, and fix an off-by-one error that caused an LBUG.
Severity : normal
Frequency : rare
Bugzilla : 11722
Description: Transient SCSI error results in persistent IO issue
Details : iobuf->dr_error is not reinitialized to 0 between two
uses.
Severity : normal
Frequency : sometimes when underlying device returns I/O errors
Bugzilla : 11743
Description: OSTs not going read-only during write failures
Details : OSTs are not remounted read-only when the journal commit threads
get I/O errors because fsfilt_ext3 calls journal_start/stop()
instead of the ext3 wrappers.
Severity : minor
Bugzilla : 12364
Description: poor connect scaling with increasing client count
Details : Don't run filter_grant_sanity_check for more than 100 exports
to improve scaling for large numbers of clients.
Severity : normal
Frequency : SLES10 only
Bugzilla : 12538
Description: sanity-quota.sh quotacheck failed: rc = -22
Details : Quotas cannot be enabled on SLES10.
Severity : normal
Frequency : liblustre clients only
Bugzilla : 12229
Description: getdirentries does not give error when run on compute nodes
Details : getdirentries does not fail when the size specified as an argument
is too small to contain at least one entry
Severity : enhancement
Bugzilla : 11548
Description: Add LNET router traceability for debug purposes
Details : If a checksum failure occurs with a router as part of the
IO path, the NID of the last router that forwarded the bulk data
Severity : normal
Frequency : rare
Bugzilla : 11315
Description: OST "spontaneously" evicts client; client has imp_pingable == 0
Details : Due to a race condition, liblustre clients were occasionally
Description: lfs setstripe use optional parameters instead of postional
Description: Nanosecond timestamp support for ldiskfs
Details : The on-disk ldiskfs filesystem has added support for nanosecond
resolution timestamps. There is not yet support for this at
the Lustre filesystem level.
Severity : normal
Frequency : during server recovery
Bugzilla : 11203
Description: MDS failing to send precreate requests due to OSCC_FLAG_RECOVERING
Details : request with rq_no_resend flag not awake l_wait_event if they get a
Severity : minor
Frequency : nfs export on patchless client
Bugzilla : 11970
Description: connectathon hang when test nfs export over patchless client
Details : Disconnected dentry cannot be found with lookup, so we do not need
Bugzilla : 11757
Description: fix llapi_lov_get_uuids() to allow many OSTs to be returned
Details: : Change llapi_lov_get_uuids() to read the UUIDs from /proc instead
of using an ioctl. This allows lfsck for > 160 OSTs to succeed.
Severity : minor
Frequency : rare
Bugzilla : 11546
Description: open req refcounting wrong on reconnect
Details : If reconnect happened between getting open reply from server and
call to mdc_set_replay_data in ll_file_open, we will schedule
replay for unreferenced request that we are about to free.
Subsequent close will crash in variety of ways.
Check that request is still eligible for replay in
mdc_set_replay_data().
Severity : minor
Frequency : rare
Bugzilla : 11512
Description: disable writes to filesystem when reading health_check file
Details : the default for reading the health_check proc file has changed
to NOT do a journal transaction and write to disk, because this
can cause reads of the /proc file to hang and block HA state
checking on a healthy but otherwise heavily loaded system. It
is possible to return to the previous behaviour during configure
with --enable-health-write.
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
Severity : enhancement
Bugzilla : 10768
Description: 64-bit inode version
Details: : Add a on-disk 64-bit inode version for ext3 to track changes made
to the inode. This will be required for version-based recovery.
Severity : normal
Frequency : rare
Bugzilla : 11818
Description: MDS fails to start if a duplicate client export is detected
Details : in some rare cases it was possible for a client to connect to
an MDS multiple times. Upon recovery the MDS would detect this
and fail during startup. Handle this more gracefully.
Severity : enhancement
Bugzilla : 11563
Description: Add -o localflock option to simulate old noflock
behaviour.
Details : This will achieve local-only flock/fcntl locks
coherentness.
Severity : minor
Frequency : rare
Bugzilla : 11658
Description: log_commit_thread vs filter_destroy race leads to crash
Details : Take import reference before releasing llog record semaphore
Severity : normal
Frequency : rare
Bugzilla : 12477
Description: Wrong request locking in request set processing
Details : ptlrpc_check_set wrongly uses req->rq_lock for proctect add to
imp_delayed_list, in this place should be used imp_lock.
Severity : normal
Frequency : when reconnection
Bugzilla : 11662
Description: Grant Leak when osc reconnect to OST
Details : When osc reconnect ost, OST(filter) should check whether it
should grant more space to client by comparing fed_grant and
cl_avail_grant, and return the granted space to client instead
of "new granted" space, because client will call osc_init_grant
to update the client grant space info.
Severity : normal
Frequency : when client reconnect to OST
Bugzilla : 11662
Description: Grant Leak when osc do resend and replay bulk write
Details : When osc reconnect to OST, OST(filter)should clear grant info of
bulk write request, because the grant info while be sync between
OSC and OST when reconnect, and we should ignore the grant info
these of resend/replay write req.
Severity : normal
Frequency : rare
Bugzilla : 11662
Description: Grant space more than avaiable left space sometimes.
Details : When then OST is about to be full, if two bulk writing from
different clients came to OST. Accord the avaliable space of the
OST, the first req should be permitted, and the second one
should be denied by ENOSPC. But if the seconde arrived before
the first one is commited. The OST might wrongly permit second
writing, which will cause grant space > avaiable space.
Severity : normal
Frequency : when client is evicted
Bugzilla : 12371
Description: Grant might be wrongly erased when osc is evicted by OST
Details : when the import is evicted by server, it will fork another
thread ptlrpc_invalidate_import_thread to invalidate the
import, where the grant will be set to 0. While the original
thread will update the grant it got when connecting. So if
the former happened latter, the grant will be wrongly errased
because of this race.
Severity : normal
Frequency : rare
Bugzilla : 12401
Description: Checking Stale with correct fid
Details : ll_revalidate_it should uses de_inode instead of op_data.fid2
to check whether it is stale, because sometimes, we want the
enqueue happened anyway, and op_data.fid2 will not be initialized.
Severity : enhancement
Bugzilla : 11647
Description: update patchless client
Details : Add support for patchless client with 2.6.20, 2.6.21 and RHEL 5
Severity : normal
Frequency : only with 2.4 kernel
Bugzilla : 12134
Description: random memory corruption
Details : size of struct ll_inode_info is to big for union inode.u and this
can be cause of random memory corruption.
Severity : normal
Frequency : rare
Bugzilla : 10818
Description: Memory leak in recovery
Details : Lov_mds_md was not free in an error handler in mds_create_object.
It should also check obd_fail before fsfilt_start, otherwise if
fsfilt_start return -EROFS,(failover mds during mds recovery).
then the req will return with repmsg->transno = 0 and rc = EROFS.
and we met hit the assert LASSERT(req->rq_reqmsg->transno ==
req->rq_repmsg->transno) in ptlrpc_replay_interpret. Fcc should
be freed no matter whether fsfilt_commit success or not.
Severity : minor
Frequency : only with huge count clients
Bugzilla : 11817
Description: Prevents from taking the superblock lock in llap_from_page for
a soon died page.
Details : using LL_ORIGIN_REMOVEPAGE origin flag instead of LL_ORIGIN_UNKNOW
for llap_from_page call in ll_removepage prevents from taking the
superblock lock for a soon died page.
Severity : normal
Frequency : rare
Bugzilla : 11935
Description: Not check open intent error before release open handle
Details : in some rare cases, the open intent error is not checked before
release open handle, which may cause
ASSERTION(open_req->rq_transno != 0), because it tries to release
the failed open handle.
Severity : normal
Frequency : rare
Bugzilla : 12556
Description: Set cat log bitmap only after create log success.
Details : in some rare cases, the cat log bitmap is set too early. and it
should be set only after create log success.
Severity : major
Bugzilla : 11971
Description: Accessing a block bevice can re-enable I/O when Lustre is
tearing down a device.
Details : dev_clear_rdonly(bdev) must be called in kill_bdev() instead of
blkdev_put().
Severity : minor
Bugzilla : 11706
Description: service threads may hog cpus when there are a lot of requests
Details : Insert cond_resched to give other threads a chance to use some CPU
Severity : normal
Frequency : rare
Bugzilla : 12086
Description: the cat log was not initialized in recovery
Details : When mds(mgs) do recovery, the tgt_count might be zero, so the
unlink log on mds will not be initialized until mds post
recovery. And also in mds post recovery, the unlink log will
initialization will be done asynchronausly, so there will be race
between add unlink log and unlink log initialization.
Severity : normal
Bugzilla : 12597
Description: brw_stats were being printed incorrectly
Details : brw_stats were being printed as log2 but all of them were not
recorded as log2. Also remove some code duplication arising from
filter_tally_{read,write}.
Severity : normal
Bugzilla : 11674
Frequency : rare, only in recovery.
Description: ASSERTION(req->rq_type != LI_POISON) failed
Details : imp_lock should be held while iterating over imp_sending_list for
prevent destroy request after get timeout in ptlrpc_queue_wait.
Severity : normal
Bugzilla : 12689
Description: replay-single.sh test 52 fails
Details : A lock's skiplist need to be cleanup when it being unlinked
Severity : normal
Bugzilla : 11737
Description: Short directio read returns full requested size rather than
Details : Direct I/O operations should return actual amount of bytes
Severity : enhancement
Bugzilla : 10589
Description: metadata RPC reduction (e.g. for rm performance)
Details : decrease the amount of synchronous RPC between clients and servers
by canceling conflicing lock before the operation on the client
and packing thier handles into the main operation RPC to server.
Severity : enhancement
Bugzilla : 4900
Description: Async OSC create to avoid the blocking unnecessarily.
Details : If a OST has no remain object, system will block on the creating
when need to create a new object on this OST. Now, ways use
pre-created objects when available, instead of blocking on an
empty osc while others are not empty. If we must block, we block
for the shortest possible period of time.
Severity : enhancement
Bugzilla : 12702
Description: refine locking for avoid write wrong info into lov_objid file
Details : fix possible races with add new target and write/update data in
lov_objid file.
--------------------------------------------------------------------------------
2007-05-03 Cluster File Systems, Inc. <info@clusterfs.com>
* version 1.6.0.1
* bug fixes
Severity : normal
Bugzilla : 12404
Description: 1.6 client sometimes fails to mount from a 1.4 MDT
Details : Uninitialized flags sometimes cause configuration commands to
be skipped.
Severity : normal
Frequency : patchless clients only
Bugzilla : 12391
Description: missing __iget() symbol export
Details : The __iget() symbol export is missing. To avoid the need for
this on patchless clients the deathrow inode reaper is turned
off, and we depend on the VM to clean up old inodes. This
dependency was during via the fix for bug 12181.
--------------------------------------------------------------------------------
* version 1.6.0
* CONFIGURATION CHANGE. This version of Lustre WILL NOT
INTEROPERATE with older versions automatically. In many cases a
special upgrade step is needed. Please read the
user documentation before upgrading any part of a live system.
* WIRE PROTOCOL CHANGE from previous 1.6 beta versions. This
version will not interoperate with 1.6 betas before beta5 (1.5.95).
* WARNING: Lustre configuration and startup changes are required with
this release. See https://mail.clusterfs.com/wikis/lustre/MountConf
for details.
Severity : enhancement
Bugzilla : 8007
Description: MountConf
Details : Lustre configuration is now managed via mkfs and mount
commands instead of lmc and lconf. New obd types (MGS, MGC)
are added for dynamic configuration management. See
https://mail.clusterfs.com/wikis/lustre/MountConf for
details.
Severity : enhancement
Bugzilla : 4482
Description: dynamic OST addition
Details : OSTs can now be added to a live filesystem
Severity : enhancement
Bugzilla : 9851
Description: startup order invariance
Details : MDTs and OSTs can be started in any order. Clients only
require the MDT to complete startup.