Skip to content
Snippets Groups Projects
ChangeLog 185 KiB
Newer Older
scjody's avatar
scjody committed
tbd  Sun Microsystems, Inc.
scjody's avatar
scjody committed
       * version 1.8.0
Andreas Dilger's avatar
Andreas Dilger committed
       * Support for kernels:
Yang Sheng's avatar
Yang Sheng committed
        2.6.9-67.EL (RHEL 4),
Yang Sheng's avatar
Yang Sheng committed
        2.6.16.54-0.2.3 (SLES 10),
Yang Sheng's avatar
Yang Sheng committed
        2.6.18-53.1.4.el5 (RHEL 5).
Andreas Dilger's avatar
Andreas Dilger committed
       * Client support for unpatched kernels:
        (see http://wiki.lustre.org/index.php?title=Patchless_Client)
        2.6.16 - 2.6.21 vanilla (kernel.org)
       * Recommended e2fsprogs version: 1.40.4-cfs1
scjody's avatar
scjody committed
       * Note that reiserfs quotas are disabled on SLES 10 in this kernel.
Bobi Jam's avatar
Bobi Jam committed
       * RHEL 4 and RHEL 5/SLES 10 clients behaves differently on 'cd' to a
        removed cwd "./" (refer to Bugzilla 14399).
scjody's avatar
scjody committed

Bobi Jam's avatar
Bobi Jam committed
Severity   : normal
Bugzilla   : 13375
Descriptoin: make lov_create() will not stuck in obd_statfs_rqset()
Details    : If an OST is down the MDS will hang indefinitely in
             obd_statfs_rqset() waiting for the statfs data. While for
	     MDS QOS usage of statfs, it should not stuck in waiting.

Yang Sheng's avatar
Yang Sheng committed
Severity   : enhancement
Bugzilla   : 14288
Description: Update to RHEL4 U6 kernel-2.6.9-67.EL.

Yang Sheng's avatar
Yang Sheng committed
Severity   : enhancement
Bugzilla   : 14368
Description: Update to RHEL5 latest kernel-2.6.18-53.1.4.el5.

Bobi Jam's avatar
Bobi Jam committed
Severity   : normal
Bugzilla   : 14136
Description: make mgs_setparam() handle fsname containing dash
Details    : fsname containing a dash does not work with lctl conf_param

Yang Sheng's avatar
Yang Sheng committed
Severity   : enhancement
Bugzilla   : 14388
Description: Update to SLES10 SP1 latest kernel-2.6.16.54-0.2.3.

Yang Sheng's avatar
Yang Sheng committed
Severity   : enhancement
Bugzilla   : 14289
Description: Update to RHEL5 Update-1 kernel 2.6.18-53.el5.

Severity   : major
Bugzilla   : 14260
Frequency  : rare, at shutdown
Description: access already free / zero obd_namespace.
Details    : if client_disconnect_export was called without force flag set,
Andreas Dilger's avatar
Andreas Dilger committed
	     and exist connect request in flight, this can produce access to
	     NULL pointer (or already free pointer) when connect_interpret 
	     store ocd flags in obd_namespace.

Severity   : minor
Bugzilla   : 14418
Frequency  : only at startup
Description: not alloc memory with spinlock held.
Details    : allocation memory with GFP_KERNEL can produce sleep deadlock,
Andreas Dilger's avatar
Andreas Dilger committed
	     if any spinlock held.
Bobi Jam's avatar
Bobi Jam committed
Severity   : enhancement
Bugzilla   : 12211
Description: make lustre randomly fail allocating memory
Details    : Make lustre randomly failed allocating memory for testing purpose.

Severity   : enhancement
Bugzilla   : 12702
Description: lost problems with lov objid file
Details    : Fixes some scability and access to not inited memory problems 
Andreas Dilger's avatar
Andreas Dilger committed
	     in work with lov objdid file.
Bobi Jam's avatar
Bobi Jam committed
Severity   : major
Frequency  : always
Bugzilla   : 14270
Description: lfs find does not continue on file error
Details    : Continue other files processing when a file/dir is absent.

Bobi Jam's avatar
Bobi Jam committed
Severity   : normal
Bugzilla   : 11791
Description: Inconsistent usage of lustre_pack_reply()
Details    : Standardize the usage of lustre_pack_reply() such that it
Andreas Dilger's avatar
Andreas Dilger committed
	     always generate a CERROR on failure.
Bobi Jam's avatar
Bobi Jam committed

Severity   : major
Frequency  : occasional
Bugzilla   : 13917
Description: MDS hang or stay in waiting lock
Details    : If client receive lock with CBPENDING flag ldlm need send lock
	     cancel as separate rpc, to avoid situation when cancel request
	     can't processed due all i/o threads stay in wait lock.

Yang Sheng's avatar
Yang Sheng committed
Severity   : normal
Yang Sheng's avatar
Yang Sheng committed
Bugzilla   : 13969
Description: Update to RHEL5 kernel 2.6.18-8.1.15.el5.

Severity   : normal
Yang Sheng's avatar
Yang Sheng committed
Bugzilla   : 13874
Description: Update to SLES10 SP1 kernel 2.6.16.53-0.16

Yang Sheng's avatar
Yang Sheng committed
Severity   : normal
Bugzilla   : 13889
Description: Update to SLES9 kernel-2.6.5-7.287.3.

Yang Sheng's avatar
Yang Sheng committed
Severity   : normal
Bugzilla   : 14041
Description: Update to RHEL4 latest kernel.

Yang Sheng's avatar
Yang Sheng committed
Severity   : enhancement
Bugzilla   : 13690
Description: Build SLES10 patchless client fails
Details    : The configure was broken by run ./configure with 
Andreas Dilger's avatar
Andreas Dilger committed
	     --with-linux-obj=.... argument for patchless client. When the
Yang Sheng's avatar
Yang Sheng committed
	     configure use --with-linux-obj, the LINUXINCLUDE= -Iinclude
	     can't search header adequately. Use absolute path such as 
	     -I($LINUX)/include instead. 

Severity   : normal
Bugzilla   : 13888
Description: interrupt oig_wait produce painc on resend.
Details    : brw_redo_request can be used for resend requests from ptlrpcd and
	     private set, and this produce situation when rq_ptlrpcd_data not
	     copyed to new allocated request and triggered LBUG on assert 
	     req->rq_ptlrpcd_data != NULL. But this member used only for wakeup
	     ptlrpcd set if request is changed and can be safety changed to use
	     rq_set directly.

Bobi Jam's avatar
Bobi Jam committed
Severity   : normal
Bugzilla   : 13497
Description: LASSERT_{REQ,REP}SWAB macros are buggy
Details    : If SWAB_PARANOIA is disabled, the LASSERT_REQSWAB and
Andreas Dilger's avatar
Andreas Dilger committed
	     LASSERT_REPSWAB macros become no-ops, which is incorrect. Drop
	     these macros and replace them with their difinitions instead.
Bobi Jam's avatar
Bobi Jam committed

Yang Sheng's avatar
Yang Sheng committed
Severity   : normal
Bugzilla   : 13521
Description: Update kernel patches for SLES10 2.6.16.53-0.8.
Details    : Update which_patch & target file for SLES10 latest kernel.

scjody's avatar
scjody committed
Bugzilla   : 12411
Description: Remove client patches from SLES 10 kernel.
Details    : This causes SLES 10 clients to behave as patchless clients
Andreas Dilger's avatar
Andreas Dilger committed
	     even on a Lustre-patched (server) kernel.
scjody's avatar
scjody committed

Yury Umanets's avatar
Yury Umanets committed
Severity   : enhancement
Bugzilla   : 2262
Description: self-adjustable client's lru lists
Details    : use adaptive algorithm for managing client cached locks lru 
Andreas Dilger's avatar
Andreas Dilger committed
	     lists according to current server load, other client's work
	     pattern, memory activities, etc. Both, server and client 
	     side namespaces provide number of proc tunables for controlling 
	     things
Yury Umanets's avatar
Yury Umanets committed

Eric Mei's avatar
Eric Mei committed
Severity   : enhancement
Bugzilla   : 13641
Description: light-weight GSS support
Details    : Support krb5n and krb5a mode, which keep Kerberos 5 authentication
	     and reduce performance overhead.

Severity   : enhancement
Bugzilla   : 11832
Description: Linux keyring support
Details    : Support using service of Linux keyring for Lustre GSS internal
	     context refresh/cache mechanism.

Cliff White's avatar
 
Cliff White committed
Severity   : normal
Bugzilla   : 12186
Description: Fix errors in lfs documentation
Details    : Fixes man pages

Bobi Jam's avatar
 
Bobi Jam committed
Severity   : normal
Bugzilla   : 12606
Description: don't use GFP_* in generic Lustre code.
Details    : Use cfs_alloc_* functions and CFS_* flags for code portability.

Liu Ying's avatar
Liu Ying committed
Severity   : normal
Bugzilla   : 12333
Description: obdclass is limited by single OBD_ALLOC(idarray)
Details    : replace OBD_ALLOC/OBD_FREE with OBD_VMALLOC/OBD_VFREE

Liu Ying's avatar
Liu Ying committed
Severity   : normal
Bugzilla   : 13006
Description: warnings with build patchless client with vanila 2.6.19 and up
Details    : change old ctl_table style and replace ctl_table/ctl_table_header
	     with cfs_sysctl_table_t/cfs_sysctl_table_header_t

Bobi Jam's avatar
Bobi Jam committed
Severity   : normal
Bugzilla   : 13177
Frequency  : Only for SLES
Description: sanity_quota fail test_1
Details    : There are multiple occurences of $TSTUSR in SLES's /etc/group
Andreas Dilger's avatar
Andreas Dilger committed
	     file, which makes TSTID[2] inunique.
Bobi Jam's avatar
Bobi Jam committed

Yang Sheng's avatar
Yang Sheng committed
Severity   : normal
Bugzilla   : 13249
Frequency  : Only for SLES9
Description: Kernel patches for SLES9 2.6.5-7.286 kernel
Details    : Update target/ChangeLog/which_patch .

tianzy's avatar
tianzy committed
Severity   : normal
Bugzilla   : 13170
Frequency  : Only for test_10 in sanity-quota.sh
Description: a bug in quota test code
Details    : checking a test flag in a wrong place

tianzy's avatar
tianzy committed
Severity   : normal
Bugzilla   : 13171
Frequency  : Only for test_13 in sanity-quota.sh
Description: a bug in quota test script
Details    : shouldn't check overfull allocated quota

kalpak's avatar
kalpak committed
Severity   : normal
Bugzilla   : 12955
Description: jbd statistics
Details    : Port older jbd statistics patch for sles10

Severity   : enhancement
Bugzilla   : 11721
Description: Add printing inode info into message about error in writepage.

Oleg Drokin's avatar
Oleg Drokin committed
Severity   : normal
Bugzilla   : 11974
Frequency  : Rare
Description: reply_lock_interpret crash due to race with it and lock cancel.
Details    : Do not replay locks that are being cancelled. Do not reference
Andreas Dilger's avatar
Andreas Dilger committed
	     locks by their address during replay, just by their handle.
Oleg Drokin's avatar
Oleg Drokin committed

Oleg Drokin's avatar
Oleg Drokin committed
Severity   : normal
Bugzilla   : 13103
Frequency  : When flocks are used.
Description: assertion failure in ldlm_cli_enquque_fini for non NULL lock.
Details    : Flock locks might destroy just granted lock if it could be merged
Andreas Dilger's avatar
Andreas Dilger committed
	     with another existing flock, this is done in completion handler,
Oleg Drokin's avatar
Oleg Drokin committed
	     so teach ldlm_cli_enquque_fini that this is a valid case for
	     flock locks.

Oleg Drokin's avatar
Oleg Drokin committed
Severity   : minor
Bugzilla   : 13276
Frequency  : rare
Description: Oops in read and write path when failing to allocate lock.
Details    : Check if lock allocation failed and return error back.

Liu Ying's avatar
Liu Ying committed
Severity   : normal 
Bugzilla   : 11679
Description: lstripe command fails for valid OST index
Details    : The stripe offset is compared to lov->desc.ld_tgt_count
	     instead of lov->desc.ld_active_tgt_count.

Bobi Jam's avatar
Bobi Jam committed
Severity   : normal
Bugzilla   : 12584
Description: sanity.sh failed test 103
Details    : RHEL mis-interpret setfacl "-X" param, so we won't test setfacl
Andreas Dilger's avatar
Andreas Dilger committed
	     with param "-X".
Bobi Jam's avatar
Bobi Jam committed

Bobi Jam's avatar
Bobi Jam committed
Severity   : normal
Bugzilla   : 12743
Description: df doesn't work properly if diskfs blocksize != 4K
Details    : Choose biggest blocksize of OST's as the LOV's blocksize.

Bobi Jam's avatar
Bobi Jam committed
Severity   : normal
Bugzilla   : 12836
Description: lfs find on -1 stripe looping in lsm_lmm_verify_common()
Details    : Avoid lov_verify_lmm_common() on directory with -1 stripe count.

Liu Ying's avatar
Liu Ying committed
Severity   : major
Bugzilla   : 12932
Description: obd_health_check_timeout too short
Details    : set obd_health_check_timeout as 1.5x of obd_timeout

Liu Ying's avatar
Liu Ying committed
Severity   : normal
Bugzilla   : 12192
Description: llapi_file_create() does not allow some changes
Details    : add llapi_file_open() that allows specifying the mode and 
	     open flags, and also returns an open file handle.

Yang Sheng's avatar
Yang Sheng committed
Severity   : normal
Bugzilla   : 11248
Description: merge and cleanup kernel patches.
Details    : Remove mnt_lustre_list in vfs_intent-2.6-rhel4.patch.

girish's avatar
girish committed
Severity   : normal
Bugzilla   : 10657
Description: Add journal checksum support.(Kernel part)
Details    : The journal checksum feature adds two new flags i.e 
Andreas Dilger's avatar
Andreas Dilger committed
	     JBD2_FEATURE_INCOMPAT_ASYNC_COMMIT and 
girish's avatar
girish committed
	     JBD2_FEATURE_COMPAT_CHECKSUM. JBD2_FEATURE_CHECKSUM flag 
	     indicates that the commit block contains the checksum for
	     the blocks described by the descriptor blocks. Now commit
	     record can be sent to disk without waiting for descriptor
	     blocks to be written to disk. This behavior is controlled
	     using JBD2_FEATURE_ASYNC_COMMIT flag.

Severity   : minor
Bugzilla   : 12446
Description: OSS needs mutliple precreate threads
Details    : Add ability to start more than one create thread per OSS.

Liu Ying's avatar
Liu Ying committed
Severity   : normal
Bugzilla   : 13362
Description: Sanity.sh test_65k failed: setstripe should have succeeded
Details    : execute lfs setstripe on client

tianzy's avatar
tianzy committed
Severity   : major
Bugzilla   : 12223
Description: mds_obd_create error creating tmp object
Details    : When the user sets quota on root, llog will be affected and can't 
Andreas Dilger's avatar
Andreas Dilger committed
	     create files and write files.
tianzy's avatar
tianzy committed

scjody's avatar
scjody committed
Severity   : normal
Frequency  : Always on ia64 patchless client, and possibly others.
Bugzilla   : 12826
Description: Add EXPORT_SYMBOL check for node_to_cpumask symbol.
Details    : This allows the patchless client to be loaded on architectures
Andreas Dilger's avatar
Andreas Dilger committed
	     without this export. 
scjody's avatar
scjody committed

Yang Sheng's avatar
Yang Sheng committed
Severity   : normal
Bugzilla   : 13039
Description: RedHat Update kernel for RHEL5
Details    : Add the kernel config file for RHEL5.

Yang Sheng's avatar
Yang Sheng committed
Severity   : normal
Bugzilla   : 13039
Description: RedHat Update kernel for RHEL5
Details    : Modify the kernel config file more closer RHEL5.

Yang Sheng's avatar
Yang Sheng committed
Severity   : normal
Bugzilla   : 13360
Description: Build failure against Centos5 (RHEL5)
Details    : Define PAGE_SIZE when it isn't present.

tianzy's avatar
tianzy committed
Severity   : minor
Bugzilla   : 13363
Description: test_7 on sanity-quota.sh can't be used on separate machines
Details    : add facet to handle it

Bobi Jam's avatar
Bobi Jam committed
Severity   : normal
Bugzilla   : 13030
Description: "ll_intent_file_open()) lock enqueue: err: -13" with nfs
Details    : with NFS, the anon dentry's parent was set to itself in
Andreas Dilger's avatar
Andreas Dilger committed
	     d_alloc_anon(), so in MDS, we use rec->ur_fid1 to find the
	     corresponding dentry other than use rec->ur_name.
Bobi Jam's avatar
Bobi Jam committed

Severity   : enhancement
Bugzilla   : 12786
Description: lfs setstripe enhancement
Details    : Make lfs setstripe understand 'k', 'm' and 'g' for stripe size.

Liu Ying's avatar
Liu Ying committed
Severity   : normal
Bugzilla   : 12398
Andreas Dilger's avatar
Andreas Dilger committed
Description: enable data checksumming by default
Details    : enable checksum by default, allow --disable-checksum
	     configure option and "-o nochecksum" mount option.  Checksums
	     can also be disabled at runtime via $LPROC/osc/*/checksum_pages.
Liu Ying's avatar
Liu Ying committed

Johann Lombardi's avatar
Johann Lombardi committed
Severity   : normal
Bugzilla   : 11802
Description: lustre support for RHEL5
Details    : Add support for RHEL5.

Johann Lombardi's avatar
Johann Lombardi committed
Severity   : normal
Bugzilla   : 12459
Description: Client eviction due to failover config
Details    : after a connection loss, the lustre client should attempt to
	     reconnect to the last active server first before trying the
	     other potential connections.

tianzy's avatar
tianzy committed
Severity   : minor
Bugzilla   : 12588
Description: when mds and osts use different quota unit(32bit and 64bit),
Andreas Dilger's avatar
Andreas Dilger committed
	     quota will be released repeatly.
Details    : void sending multiple quota reqs to mds, which will keep the
	     status between the reqs.
Liu Ying's avatar
Liu Ying committed

Severity   : normal 
Bugzilla   : 13125
Description: osts not allocated evenly to files
Details    : change the condition to increase offset_idx 

Andreas Dilger's avatar
Andreas Dilger committed
Severity   : critical
Frequency  : Always for filesystems larger than 2TB on 32-bit systems.
Bugzilla   : 13547
Description: Data corruption for OSTs that are formatted larger than 2TB
	     on 32-bit servers.
Details    : When generating the bio request for lustre file writes the
	     sector number would overflow a temporary variable before being
	     used for the IO.  The data reads correctly from Lustre (which
	     will overflow in a similar manner) but other file data or
	     filesystem metadata may be corrupted in some cases.

komaln's avatar
komaln committed
Severity   : normal
Bugzilla   : 11230
Description: Tune the kernel for good SCSI performance. 
Details    : Set the value of /sys/block/{dev}/queue/max_sectors_kb
Andreas Dilger's avatar
Andreas Dilger committed
	     to the value of /sys/block/{dev}/queue/max_hw_sectors_kb
	     in mount_lustre.
komaln's avatar
komaln committed

Severity   : cleanup
Bugzilla   : 13532
Description: rewrite ext2-derived code in obdclass/uuid.c
Details    : rewrite inherited code (uuid parsing code from ext2 utils)
	     from scratch preserving functionality.
Bobi Jam's avatar
Bobi Jam committed

Severity   : normal
Bugzilla   : 13600
Description: "lfs find -obd UUID" prints directories
Details    : "lfs find -obd UUID" will return all directory names instead
Andreas Dilger's avatar
Andreas Dilger committed
	     of just file names. It is incorrect because the directories
	     do not reside on the OSTs.
Bobi Jam's avatar
Bobi Jam committed

Liu Ying's avatar
Liu Ying committed
Severity   : minor
Bugzilla   : 2369
Description: use i_size_read and i_size_write in 2.6 port
Details    : replace inode->i_size access with i_size_read/write()

Johann Lombardi's avatar
Johann Lombardi committed
Severity   : normal
Frequency  : when removing large files
Bugzilla   : 13181
Description: scheduling issue during removal of large Lustre files
Details    : Don't take the BKL in fsfilt_ext3_setattr() for 2.6 kernels.
	     It causes scheduling issues when removing large files (17TB in the
	     present case).

Severity   : normal
Frequency  : only with liblustre clients on XT3
Bugzilla   : 12418
Description: evictions taking too long
Details    : allow llrd to evict clients directly on OSTs

bwzhou's avatar
bwzhou committed
Severity   : normal
Frequency  : only on ppc
Bugzilla   : 12234
Description: /proc/fs/lustre/devices broken on ppc
Details    : The patch as applied to 1.6.2 doesn't look correct for all arches.
	     We should make sure the type of 'index' is loff_t and then cast 
	     explicitly as needed below.  Do not assign an explicitly cast 
	     loff_t to an int.
Yang Sheng's avatar
Yang Sheng committed

Severity   : normal
Frequency  : only for rhel5
Bugzilla   : 13616
Description: Kernel patches update for RHEL5 2.6.18-8.1.10.el5.
Details    : Modify the target file & which_kernel.

Severity   : enhancement
Bugzilla   : 10786
Description: omit set fsid for export NFS
Details    : fix set/restore device id for avoid EMFILE error and mark lustre fs
	     as FS_REQUIRES_DEV for avoid problems with generate fsid.

Severity   : normal
Bugzilla   : 13304
Frequency  : Always, for kernels after 2.6.16
Andreas Dilger's avatar
Andreas Dilger committed
Description: Fix warning idr_remove called for id=.. which is not allocated.
Details    : Last kernels save old s_dev before kill super and not allow 
Andreas Dilger's avatar
Andreas Dilger committed
	     to restore from callback - restore it before call kill_anon_super.
Bobi Jam's avatar
Bobi Jam committed
Severity   : minor
Bugzilla   : 12948
Description: buffer overruns could theoretically occur
Details    : llapi_semantic_traverse() modifies the "path" argument by
Andreas Dilger's avatar
Andreas Dilger committed
	     appending values to the end of the origin string, and a 
	     overrun may occur. Adding buffer overrun check in liblustreapi.
Bobi Jam's avatar
Bobi Jam committed

Severity   : normal
scjody's avatar
scjody committed
Bugzilla   : 13334
Description: Fix error on 'ls .' at the top of the Lustre mount.
Details    : Don't revalidate dentry if it is a root dentry.
Yang Sheng's avatar
Yang Sheng committed
Severity   : normal
Bugzilla   : 13518
Description: Kernel patches update for RHEL4 2.6.9-55.0.6.
Details    : Modify vm-tunables-rhel4.patch.

Yang Sheng's avatar
Yang Sheng committed
Severity   : normal
Bugzilla   : 13452
Description: Kernel config for 2.6.18-vanilla.
Details    : Modify targets/2.6-vanilla.target.in.
Andreas Dilger's avatar
Andreas Dilger committed
	     Add config file kernel-2.6.18-2.6-vanilla-i686.config.
	     Add config file kernel-2.6.18-2.6-vanilla-i686-smp.config.
	     Add config file kernel-2.6.18-2.6-vanilla-x86_64.config.
	     Add config file kernel-2.6.18-2.6-vanilla-x86_64-smp.config.
Yang Sheng's avatar
Yang Sheng committed

Severity   : major
Bugzilla   : 11710
Description: improve handling recoverable errors
Details    : if request processig with error which can be recoverable on server
Andreas Dilger's avatar
Andreas Dilger committed
	     request should be resend, otherwise page released from cache and
	     marked as error.
Yang Sheng's avatar
Yang Sheng committed
Severity   : critical
Bugzilla   : 13751
Description: Kernel patches update for RHEL5 2.6.18-8.1.14.el5.
Details    : Modify target file & which_patch.
Andreas Dilger's avatar
Andreas Dilger committed
	     A flaw was found in the IA32 system call emulation provided 
	     on AMD64 and Intel 64 platforms. An improperly validated 64-bit 
	     value could be stored in the %RAX register, which could trigger an 
	     out-of-bounds system call table access. An untrusted local user 
	     could exploit this flaw to run code in the kernel 
	     (ie a root privilege escalation). (CVE-2007-4573). 
Yang Sheng's avatar
Yang Sheng committed

komaln's avatar
 
komaln committed
Severity   : major
Bugzilla   : 13093
Description: O_DIRECT bypasses client statistics.
Details    : When running with O_DIRECT I/O, neither the client rpc_stats nor
Andreas Dilger's avatar
Andreas Dilger committed
	     read_ahead_stats were updated. copied the stats section from
	     osc_send_oap_rpc() into async_internal().
komaln's avatar
 
komaln committed

kalpak's avatar
kalpak committed
Severity   : normal
Bugzilla   : 13454
Description: Add jbd statistics patch for RHEL5 and 2.6.18-vanilla

Liu Ying's avatar
Liu Ying committed
Severity   : minor
Bugzilla   : 13732 
Description: change order of libsysio includes
Details    : '#include sysio.h' should always come before '#include xtio.h'
Yang Sheng's avatar
Yang Sheng committed

kalpak's avatar
 
kalpak committed
Severity   : normal
Bugzilla   : 11673
Description: handle "serious error: objid * already exists" more gracefully
Details    : If LAST_ID value on disk is smaller than the objects existing in
Andreas Dilger's avatar
Andreas Dilger committed
	     the O/0/d* directories, it indicates disk corruption and causes an
	     LBUG(). If the object is 0-length, then we should use the existing
	     object. This will help to avoid a full fsck in most cases.
kalpak's avatar
 
kalpak committed

Jian Yu's avatar
Jian Yu committed
Severity   : enhancement
Bugzilla   : 13207
Description: adapt the lustre_config script to support the upgrade case
Details    : Add "-u" option for lustre_config script to support upgrading 1.4
Andreas Dilger's avatar
Andreas Dilger committed
	     server targets to 1.6 in parallel.
Jian Yu's avatar
Jian Yu committed

Wang Di's avatar
Wang Di committed
Severity   : normal
Bugzilla   : 13570
Wang Di's avatar
Wang Di committed
Description: To avoid grant space > avaible space when the disk is almost
Andreas Dilger's avatar
Andreas Dilger committed
	     full. Without this patch you might see the error "grant XXXX >
	     available" or some LBUG about grant, when the disk is almost 
	     full.
Wang Di's avatar
Wang Di committed
Details    : In filter_check_grant, for non_grant cache write, we should
Andreas Dilger's avatar
Andreas Dilger committed
	     check the left space by  if (*left > ungranted + bytes), instead
	     of (*left > ungranted), because only we are sure the left space 
	     is enough for another "bytes", then the ungrant space should be
	     increase. In client, we should update cl_avail_grant only there
	     is OBD_MD_FLGRANT in the reply.
Wang Di's avatar
Wang Di committed

Yang Sheng's avatar
Yang Sheng committed
Severity   : critical
Bugzilla   : 13748
Description: Update RHEL 4 kernel to fix local root privilege escalation.
Details    : Update to the latest RHEL 4 kernel to fix the vulnerability
	     described in CVE-2007-4573.  This problem could allow untrusted
	     local users to gain root access.

Johann Lombardi's avatar
Johann Lombardi committed
Severity   : normal
Frequency  : when using O_DIRECT and quotas
Bugzilla   : 13930
Description: Incorrect file ownership on O_DIRECT output files
Details    : block usage reported by 'lfs quota' does not take into account
	     files that have been written with O_DIRECT.
Severity   : normal
Frequency  : always
Bugzilla   : 13976
Description: touch file failed when fs is not full
Details    : OST in recovery should not be discarded by MDS in alloc_qos(),
Andreas Dilger's avatar
Andreas Dilger committed
	     otherwise we can get ENOSP while fs is not full.
Severity   : normal
Bugzilla   : 11301
Description: parallel lock callbacks
Details    : Instead of sending blocking and completion callbacks as separated
Andreas Dilger's avatar
Andreas Dilger committed
	     requests, adding them to a set and sending in parallel.
Johann Lombardi's avatar
Johann Lombardi committed
Severity   : normal
Frequency  : only for Cray XT3
Bugzilla   : 12829/13455
Description: Changing primary group doesn't change the group lustre assigns to
	     a file
Details    : When CRAY_XT3 is defined, the fsgid supplied by the client is
	     overridden with the primary group provided by the group upcall,
	     whereas the supplied fsgid can be trusted if it is in the list of
	     supplementary groups returned by the group upcall.

Johann Lombardi's avatar
Johann Lombardi committed
Severity   : enhancement
Bugzilla   : 14398
Description: Allow masking D_WARNING, D_ERROR messages from console
Details    : Console messages can now be disabled via lnet.printk.

Bobi Jam's avatar
 
Bobi Jam committed
--------------------------------------------------------------------------------

scjody's avatar
scjody committed
2007-08-10         Cluster File Systems, Inc. <info@clusterfs.com>
       * version 1.6.1
       * Support for kernels:
scjody's avatar
scjody committed
        2.6.5-7.283 (SLES 9),
        2.6.9-55.EL (RHEL 4),
        2.6.16.46-0.14 (SLES 10),
scjody's avatar
scjody committed
        2.6.18.8 vanilla (kernel.org)
       * Client support for unpatched kernels:
        (see http://wiki.lustre.org/index.php?title=Patchless_Client)
        2.6.16 - 2.6.22 vanilla (kernel.org)
       * Due to recently discovered recovery problems, we do not recommend
        using patchless RHEL 4 clients with this or any earlier release.
       * Recommended e2fsprogs version: 1.39.cfs8
Andreas Dilger's avatar
Andreas Dilger committed
       * Note that reiserfs quotas are disabled on SLES 10 in this kernel.
scjody's avatar
scjody committed
       * Starting with this release, the ldiskfs backing filesystem required
        by Lustre is now in its own package, lustre-ldiskfs.  This package
        should be installed.  It is versioned separately from Lustre and
        may be released separately in future.
Cliff White's avatar
 
Cliff White committed

Nathan Rutman's avatar
Nathan Rutman committed
Severity   : minor
Bugzilla   : 13147
Description: block reactivating mgc import until all deactivates complete
Details    : Fix race when failing back MDT/MGS to itself (testing)

Nathan Rutman's avatar
Nathan Rutman committed
Severity   : minor
Frequency  : at statup only
Bugzilla   : 12860
Description: mds_lov_synchronize race leads to various problems
Details    : simultaneous MDT->OST connections at startup can cause the 
	     sync to abort, leaving the OSC in a bad state.

Severity   : enhancement
Bugzilla   : 12194
Description: add optional extra BUILD_VERSION info
Details    : add a new environment variable (namely LUSTRE_VERS) which allows
	     to override the lustre version.

Severity   : normal
Frequency  : 2.6.18 servers only
Bugzilla   : 12546
Description: ll_kern_mount() doesn't release the module reference
Details    : The ldiskfs module reference count never drops down to 0
	     because ll_kern_mount() doesn't release the module reference.

Severity   : normal
Frequency  : rare
Bugzilla   : 12470
Description: server LBUG when using old ost_num_threads parameter
Details    : Accept the old ost_num_threads parameter but warn that it
	     is deprecated, and fix an off-by-one error that caused an LBUG.

Severity   : normal
Frequency  : rare
Bugzilla   : 11722
Description: Transient SCSI error results in persistent IO issue
Details    : iobuf->dr_error is not reinitialized to 0 between two
	     uses.

Severity   : normal
Frequency  : sometimes when underlying device returns I/O errors
Bugzilla   : 11743
Description: OSTs not going read-only during write failures
Details    : OSTs are not remounted read-only when the journal commit threads
	     get I/O errors because fsfilt_ext3 calls journal_start/stop()
	     instead of the ext3 wrappers.

Severity   : minor
Bugzilla   : 12364
Description: poor connect scaling with increasing client count
Details    : Don't run filter_grant_sanity_check for more than 100 exports
	     to improve scaling for large numbers of clients.

Severity   : normal
Frequency  : SLES10 only
Bugzilla   : 12538
Description: sanity-quota.sh quotacheck failed: rc = -22
Details    : Quotas cannot be enabled on SLES10.

Johann Lombardi's avatar
Johann Lombardi committed
Severity   : normal
Frequency  : liblustre clients only
Bugzilla   : 12229
Description: getdirentries does not give error when run on compute nodes
Details    : getdirentries does not fail when the size specified as an argument
	     is too small to contain at least one entry

scjody's avatar
scjody committed
Severity   : enhancement
Bugzilla   : 11548
Description: Add LNET router traceability for debug purposes
Details    : If a checksum failure occurs with a router as part of the
Andreas Dilger's avatar
Andreas Dilger committed
	     IO path, the NID of the last router that forwarded the bulk data
scjody's avatar
scjody committed
	     is printed so it can be identified.

scjody's avatar
scjody committed
Severity   : normal
Frequency  : rare
Bugzilla   : 11315
Description: OST "spontaneously" evicts client; client has imp_pingable == 0
Details    : Due to a race condition, liblustre clients were occasionally
Andreas Dilger's avatar
Andreas Dilger committed
	     evicted incorrectly.
Andreas Dilger's avatar
Andreas Dilger committed

Severity   : enhancement
Bugzilla   : 10997
Andreas Dilger's avatar
Andreas Dilger committed
Description: lfs setstripe use optional parameters instead of postional
Andreas Dilger's avatar
Andreas Dilger committed
	     parameters.

Andreas Dilger's avatar
Andreas Dilger committed
Severity   : enhancement
kalpak's avatar
kalpak committed
Bugzilla   : 10651
Andreas Dilger's avatar
Andreas Dilger committed
Description: Nanosecond timestamp support for ldiskfs
Details    : The on-disk ldiskfs filesystem has added support for nanosecond
	     resolution timestamps.  There is not yet support for this at
	     the Lustre filesystem level.
kalpak's avatar
kalpak committed

Severity   : normal
Frequency  : during server recovery
Bugzilla   : 11203
Description: MDS failing to send precreate requests due to OSCC_FLAG_RECOVERING
Details    : request with rq_no_resend flag not awake l_wait_event if they get a
Andreas Dilger's avatar
Andreas Dilger committed
	     timeout.
Severity   : minor
Frequency  : nfs export on patchless client
Bugzilla   : 11970
Description: connectathon hang when test nfs export over patchless client
Nathan Rutman's avatar
Nathan Rutman committed
Details    : Disconnected dentry cannot be found with lookup, so we do not need
Andreas Dilger's avatar
Andreas Dilger committed
	     to unhash it or make it invalid
kalpak's avatar
kalpak committed
Bugzilla   : 11757
Description: fix llapi_lov_get_uuids() to allow many OSTs to be returned
Details:   : Change llapi_lov_get_uuids() to read the UUIDs from /proc instead
	     of using an ioctl. This allows lfsck for > 160 OSTs to succeed.

Oleg Drokin's avatar
Oleg Drokin committed
Severity   : minor
Frequency  : rare
Bugzilla   : 11546
Description: open req refcounting wrong on reconnect
Details    : If reconnect happened between getting open reply from server and
	     call to mdc_set_replay_data in ll_file_open, we will schedule
	     replay for unreferenced request that we are about to free.
	     Subsequent close will crash in variety of ways.
	     Check that request is still eligible for replay in
	     mdc_set_replay_data().

Andreas Dilger's avatar
Andreas Dilger committed
Severity   : minor
Frequency  : rare
Bugzilla   : 11512
Description: disable writes to filesystem when reading health_check file
Details    : the default for reading the health_check proc file has changed
	     to NOT do a journal transaction and write to disk, because this
	     can cause reads of the /proc file to hang and block HA state
	     checking on a healthy but otherwise heavily loaded system.  It
	     is possible to return to the previous behaviour during configure
	     with --enable-health-write.

Severity   : enhancement
Bugzilla   : 10768
Description: 64-bit inode version
Details:   : Add a on-disk 64-bit inode version for ext3 to track changes made
	     to the inode. This will be required for version-based recovery.

Severity   : normal
Frequency  : rare
Bugzilla   : 11818
Description: MDS fails to start if a duplicate client export is detected
Details    : in some rare cases it was possible for a client to connect to
	     an MDS multiple times.  Upon recovery the MDS would detect this
	     and fail during startup.  Handle this more gracefully.

Severity   : enhancement
Bugzilla   : 11563
Description: Add -o localflock option to simulate  old noflock
behaviour.
Details    : This will achieve local-only flock/fcntl locks
	     coherentness.

Severity   : minor
Frequency  : rare
Bugzilla   : 11658
Description: log_commit_thread vs filter_destroy race leads to crash
Details    : Take import reference before releasing llog record semaphore

Severity   : normal
Frequency  : rare
Bugzilla   : 12477
Description: Wrong request locking in request set processing
Details    : ptlrpc_check_set wrongly uses req->rq_lock for proctect add to
	     imp_delayed_list, in this place should be used imp_lock.

Severity   : normal
Frequency  : when reconnection
Bugzilla   : 11662
Description: Grant Leak when osc reconnect to OST
Details    : When osc reconnect ost, OST(filter) should check whether it
	     should grant more space to client by comparing fed_grant and
	     cl_avail_grant, and return the granted space to client instead
	     of "new granted" space, because client will call osc_init_grant
	     to update the client grant space info.

Severity   : normal
Frequency  : when client reconnect to OST 
Bugzilla   : 11662
Description: Grant Leak when osc do resend and replay bulk write
Details	   : When osc reconnect to OST, OST(filter)should clear grant info of
	     bulk write request, because the grant info while be sync between
	     OSC and OST when reconnect, and we should ignore the grant info
	     these of resend/replay write req.

Severity   : normal
Frequency  : rare 
Bugzilla   : 11662
Description: Grant space more than avaiable left space sometimes.
Details	   : When then OST is about to be full, if two bulk writing from
	     different clients came to OST. Accord the avaliable space of the
	     OST, the first req should be permitted, and the second one
	     should be denied by ENOSPC. But if the seconde arrived before 
	     the first one is commited. The OST might wrongly permit second
	     writing, which will cause grant space > avaiable space.

Severity   : normal
Frequency  : when client is evicted 
Bugzilla   : 12371
Description: Grant might be wrongly erased when osc is evicted by OST
Details    : when the import is evicted by server, it will fork another 
	     thread ptlrpc_invalidate_import_thread to invalidate the 
	     import, where the grant will be set to 0.  While the original 
	     thread will update the grant it got when connecting. So if 
	     the former happened latter, the grant will be wrongly errased 
	     because of this race.

Severity   : normal
Frequency  : rare 
Bugzilla   : 12401
Description: Checking Stale with correct fid 
Details    : ll_revalidate_it should uses de_inode instead of op_data.fid2 
	     to check whether it is stale, because sometimes, we want the 
	     enqueue happened anyway, and op_data.fid2 will not be initialized.

Severity   : enhancement
Bugzilla   : 11647
Description: update patchless client
Details    : Add support for patchless client with 2.6.20, 2.6.21 and RHEL 5

Severity   : normal
Frequency  : only with 2.4 kernel
Bugzilla   : 12134
Description: random memory corruption
Details    : size of struct ll_inode_info is to big for union inode.u and this
	     can be cause of random memory corruption.

Severity   : normal
Frequency  : rare 
Bugzilla   : 10818
Description: Memory leak in recovery
Details    : Lov_mds_md was not free in an error handler in mds_create_object.
	     It should also check obd_fail before fsfilt_start, otherwise if 
	     fsfilt_start return -EROFS,(failover mds during mds recovery).
	     then the req will return with repmsg->transno = 0 and rc = EROFS. 
	     and we met hit the assert LASSERT(req->rq_reqmsg->transno ==
	     req->rq_repmsg->transno) in ptlrpc_replay_interpret.  Fcc should 
	     be freed no matter whether fsfilt_commit success or not.

Severity   : minor
Frequency  : only with huge count clients
Bugzilla   : 11817
Description: Prevents from taking the superblock lock in llap_from_page for 
	     a soon died page.
Details    : using LL_ORIGIN_REMOVEPAGE origin flag instead of LL_ORIGIN_UNKNOW 
	     for llap_from_page call in ll_removepage prevents from taking the 
	     superblock lock for a soon died page.

Severity   : normal
Frequency  : rare
Bugzilla   : 11935 
Description: Not check open intent error before release open handle
Details    : in some rare cases, the open intent error is not checked before
	     release open handle, which may cause
	     ASSERTION(open_req->rq_transno != 0), because it tries to release
	     the failed open handle.

Severity   : normal
Frequency  : rare
Bugzilla   : 12556 
Description: Set cat log bitmap only after create log success. 
Details    : in some rare cases, the cat log bitmap is set too early. and it 
	     should be set only after create log success.

Severity   : major
Bugzilla   : 11971
Description: Accessing a block bevice can re-enable I/O when Lustre is
	     tearing down a device.
Details    : dev_clear_rdonly(bdev) must be called in kill_bdev() instead of
	     blkdev_put().

Severity   : minor
Bugzilla   : 11706
Description: service threads may hog cpus when there are a lot of requests
Andreas Dilger's avatar
Andreas Dilger committed
Details    : Insert cond_resched to give other threads a chance to use some CPU

Severity   : normal
Frequency  : rare
Bugzilla   : 12086 
Description: the cat log was not initialized in recovery 
Details    : When mds(mgs) do recovery, the tgt_count might be zero, so the
	     unlink log on mds will not be initialized until mds post
	     recovery. And also in mds post recovery, the unlink log will 
	     initialization will be done asynchronausly, so there will be race
	     between add unlink log and unlink log initialization.

Severity   : normal
Bugzilla   : 12597
Description: brw_stats were being printed incorrectly
Details    : brw_stats were being printed as log2 but all of them were not
Andreas Dilger's avatar
Andreas Dilger committed
	     recorded as log2. Also remove some code duplication arising from
	     filter_tally_{read,write}.

Severity   : normal
Bugzilla   : 11674
Frequency  : rare, only in recovery.
Description: ASSERTION(req->rq_type != LI_POISON) failed
Details    : imp_lock should be held while iterating over imp_sending_list for
Andreas Dilger's avatar
Andreas Dilger committed
	     prevent destroy request after get timeout in ptlrpc_queue_wait.

Severity   : normal
Bugzilla   : 12689
Description: replay-single.sh test 52 fails
Details    : A lock's skiplist need to be cleanup when it being unlinked 
Andreas Dilger's avatar
Andreas Dilger committed
	     from its resource list.

Severity   : normal
Bugzilla   : 11737
Description: Short directio read returns full requested size rather than
Andreas Dilger's avatar
Andreas Dilger committed
	     actual amount read.
Details    : Direct I/O operations should return actual amount of bytes
Andreas Dilger's avatar
Andreas Dilger committed
	     transferred rather than requested size.
Vitaly Fertman's avatar
Vitaly Fertman committed
Severity   : enhancement
Bugzilla   : 10589
Description: metadata RPC reduction (e.g. for rm performance)
Details    : decrease the amount of synchronous RPC between clients and servers
Andreas Dilger's avatar
Andreas Dilger committed
	     by canceling conflicing lock before the operation on the client
	     and packing thier handles into the main operation RPC to server.
Vitaly Fertman's avatar
Vitaly Fertman committed

Severity   : enhancement
Bugzilla   : 4900
Description: Async OSC create to avoid the blocking unnecessarily.
Details    : If a OST has no remain object, system will block on the creating
Andreas Dilger's avatar
Andreas Dilger committed
	    when need to create a new object on this OST. Now, ways use
	    pre-created objects when available, instead of blocking on an
	    empty osc while others are not empty.  If we must block, we block
	    for the shortest possible period of time.

Severity   : enhancement
Bugzilla   : 12702
Description: refine locking for avoid write wrong info into lov_objid file
Details    : fix possible races with add new target and write/update data in 
	     lov_objid file.
Nathan Rutman's avatar
Nathan Rutman committed
--------------------------------------------------------------------------------
scjody's avatar
scjody committed

Nathan Rutman's avatar
Nathan Rutman committed
2007-05-03  Cluster File Systems, Inc. <info@clusterfs.com>
       * version 1.6.0.1
       * bug fixes

Severity   : normal
Andreas Dilger's avatar
Andreas Dilger committed
Frequency  : on some architectures
Nathan Rutman's avatar
Nathan Rutman committed
Bugzilla   : 12404
Description: 1.6 client sometimes fails to mount from a 1.4 MDT
Details    : Uninitialized flags sometimes cause configuration commands to
	     be skipped.
Andreas Dilger's avatar
Andreas Dilger committed

Severity   : normal
Frequency  : patchless clients only
Bugzilla   : 12391
Description: missing __iget() symbol export
Details    : The __iget() symbol export is missing.  To avoid the need for
	     this on patchless clients the deathrow inode reaper is turned
	     off, and we depend on the VM to clean up old inodes.  This
	     dependency was during via the fix for bug 12181.
Andreas Dilger's avatar
Andreas Dilger committed
	
Andreas Dilger's avatar
Andreas Dilger committed
--------------------------------------------------------------------------------

scjody's avatar
 
scjody committed
2007-04-19  Cluster File Systems, Inc. <info@clusterfs.com>
Nathan Rutman's avatar
Nathan Rutman committed
       * version 1.6.0
       * CONFIGURATION CHANGE.  This version of Lustre WILL NOT
	 INTEROPERATE with older versions automatically.  In many cases a
	 special upgrade step is needed. Please read the
	 user documentation before upgrading any part of a live system.
       * WIRE PROTOCOL CHANGE from previous 1.6 beta versions.  This
         version will not interoperate with 1.6 betas before beta5 (1.5.95).  
Nathan Rutman's avatar
Nathan Rutman committed
       * WARNING: Lustre configuration and startup changes are required with
	 this release.  See https://mail.clusterfs.com/wikis/lustre/MountConf
	 for details.
Yury Umanets's avatar
Yury Umanets committed
       * bug fixes

Nathan Rutman's avatar
Nathan Rutman committed
Severity   : enhancement
Bugzilla   : 8007
Description: MountConf
Details    : Lustre configuration is now managed via mkfs and mount
	     commands instead of lmc and lconf.  New obd types (MGS, MGC)
	     are added for dynamic configuration management.  See
	     https://mail.clusterfs.com/wikis/lustre/MountConf for
	     details.

Severity   : enhancement
Bugzilla   : 4482
Description: dynamic OST addition
Details    : OSTs can now be added to a live filesystem

Severity   : enhancement
Bugzilla   : 9851
Description: startup order invariance
Details    : MDTs and OSTs can be started in any order.  Clients only
	     require the MDT to complete startup.
Nathan Rutman's avatar
Nathan Rutman committed
Severity   : enhancement
Bugzilla   : 4899