- Dec 03, 2003
-
-
Mike Shaver authored
is interrupted on the client). r=robert.
-
Phil Schwan authored
Andreas's patch to fix MDS lock inversions in getattr/reint paths. I'm giving it one more day to bake on ALC before I commit to the 1.0.x branch.
-
Phil Schwan authored
-
Phil Schwan authored
-
- Sep 12, 2003
-
-
Andreas Dilger authored
have cascading failures. Server bulk timeout is 1/4 of the client timeout. Also fix /proc variables to be int, as that is what the functions expect. b=1845
-
Andreas Dilger authored
b=1765
-
Andreas Dilger authored
b=1600
-
Andreas Dilger authored
-
- Aug 12, 2003
-
-
Andreas Dilger authored
b=1749 r=phil
-
- Aug 08, 2003
-
-
Andreas Dilger authored
-
- Jul 25, 2003
-
-
Phil Schwan authored
- dozens and dozens of fixes for working with 2.6 kernels - new 2.4 kernel APIs - uncountable bug fixes
-
- Jul 03, 2003
-
-
Phil Schwan authored
-
- Jun 26, 2003
-
-
Phil Schwan authored
-
- Jun 12, 2003
-
-
Phil Schwan authored
- remove extN/ files
-
- Mar 02, 2003
-
-
Phil Schwan authored
-
- Feb 11, 2003
-
-
Phil Schwan authored
Call LDLM_DEBUG0 instead of LDLM_DEBUG when there are no arguments. This is a temporary workaround for gcc 3.2 that I really hate.
-
- Feb 07, 2003
-
-
Phil Schwan authored
* bug fixes - Fix ldlm_lock_match on the MDS to avoid matching remote locks (592) - Fix fsfilt_extN_readpage() to read a full page of directory entries, or fake the remainder if PAGE_SIZE != blocksize (500) - Avoid extra mdc_getattr() in ll_intent_lock when possible (534, 604) - Fix imbalanced LOV object allocation and out-of-bound access (469) - Most intent operations were removed, in favour of a new RPC mode that does a single RPC to the server and bypasses most of the VFS - All LDLM resource ID arrays were removed in favour of ldlm_res_id - Aggressively cancel local locks on DLM servers - mds_reint_unlink sends EA to the client if it's the last nlink. client uses that EA to unlink OST objects. - mds_reint_{rename,unlink,link} were rewritten to take ordered locks - recursive symlinks were fixed (440) - fixed NULL deref in DEBUG_REQ - filter_update_lastobjid no longer calls sync, which annoyed extN - fixed multi-client small-writes to a single file problem (445) - fixed mtime updates during file writes (607) - fixed vector writes on obdfilter causing problems when ENOSPC (670) - fixed bug in obd_brw_read/write() (under guise of testing 367) - fixed Linux OST size reporting problem (444, 656) - OST now updates object mtime with writes or setattr (607, 619) - client verifies file size before zeroing page past EOF (445) - OST now writes last allocated objid to disk with allocation (108) - LOV on echo now works (409) * protocol changes - mds_reint_unlink sends a new buffer, with the EA included. this buffer is only valid if body->valid & OBD_MD_FLEASIZE, which is only set if a regular file was being unlinked, and it was the last link - use PtlGet from the target for bulk writes (315) - OST now updates object mtime with writes or setattr (607, 619) - LDLM now has a grant-time callback to revalidate locked items, if necessary (604) - Many MDS operations were reorganized to combat race conditions * other changes - Merge b_intel branch (updated lprocfs code) - now at /proc/fs/lustre - configure check to avoid gcc version 2.96 20000731-2.96-98 (606)
-
- Jan 06, 2003
-
-
Andreas Dilger authored
Fixes a _huge_ number of bugs: - Fully reactivate OST imports after reconnection (512, others) - Make sure client sees our -ENOTCONN from mds_handle (513 - partial) - More graceful error handling for truncating on dead OST (515) - Don't error out unless we're actually accessing dead stripes (474) - Fix garbage sizes when stripes are missing (410) - LRU counters were broken, causing constant lock purge (433, 432) - garbage on read from stripes with failed OSTs (441) - mark OSCs as active before reconnecting during recovery (438) - lov_enqueue and lov_cancel need to handle inactive OSTs (403) - lfind did not preserve OST order in output (443) - symlinks cause hung clients, incorrect data (439) - stop dereferencing request after dropping refcount (457) - don't LASSERT(spin_is_locked) on non-SMP (455) - fixes for many rename() bugs - fstat didn't correctly synchronize attributes (399) - server must handle lock cancellation during blocking AST prep (487) - bulk descriptors were free()d too soon (511) - fix paths in lconf, which would load incorrect modules (451, 507) - fix confusing lconf 'host not found' error message (386) - fix lock order deadlock on OST (O/R i_sem before journal ops, 478) - fix race condition in mdc_blocking_ast() for inode access (526) - fix lov_unpackmd() unpacking wrong number of stripes (537) - fix lov_set_osc_active() marking wrong OSC inactive (440) - fix bad lstripe lov_unpackmd() assertion (fix layering too) (527) - fix multiple writes of stripe MD to MDS (358, maybe 519) - fix lstripe in several ways (kernel side) (527) - fix request leak in ldlm_cli_enqueue (262) - incorrect OSC was marked inactive after OST failure - call mds_fs_cleanup before unmounting filesystem (524) - fix races between taking ns_lock and ldlm_lock_change_resource - fix races updating LOV export open file list - fix lov_enqueue error path, avoid decref-ing bad lock handle - fix recovery NULL deref in ldlm_cli_cancel_unused - fix some DLM races by using new hash table for lock handles (419) - permit the client to specify desired inodes, at replay - duplicate requests when we queue them for replay reintegration - fix last_rcvd offset calculation - sync after each recovered transaction, so we always make progress - never, not always, ERESTART requests without transnos - store the lov_desc in the MDS, so we don't depend on getlovinfo to set it - skip replay if the MDS says that the client is already connected - don't check for a recovery-enabled export to match lctl's UUID - don't INC_USE_COUNT for phantom exports - don't crash when cleaning up phantom exports (567) - don't double-finish or set replay data for errored mdc_open requests - abort requests when they time out, so we don't get old replies - send/receive replies for AST messages again - if the client says that it doesn't have the lock, cancel it on the server - if we timeout during I/O, don't try to cancel an in-use lock; instead mark it as destroyed, it will all work out when decref is called - fix module use counts (22, 581) * protocol changes - ASTs now expect a reply (server cancels lock on error reply)
-
- Dec 17, 2002
-
-
Phil Schwan authored
land b_md onto HEAD. almost entirely small cleanups and miscellaneous fixes, plus: - fix lock order deadlock on OST (O/R i_sem before journal ops, 478) - fix follow_symlink bugs (437) - preparation for some ldlm race fixes (419)
-
- Dec 16, 2002
-
-
Mike Shaver authored
- b=512,others: fully reactivate OST imports after reconnection. - b=513(partial): make sure client sees out -ENOTCONN from mds_handle - b=515: more graceful error handling for truncating on dead OST - b=474: don't error out just because a file is striped across a dead OST; only if we actually try to access it - b=410: fix garbage sizes when stat(2)ing a file that includes a stripe on a dead OST - print console diagnostic for completion-timeout client recovery - MSG_RESENT for requests that were retransmitted
-
- Dec 14, 2002
-
-
Phil Schwan authored
- fstat didn't correctly synchronize attributes (399) - server must handle lock cancellation during blocking AST prep (487) - bulk descriptors were free()d too soon (511) - fix paths in lconf, which would load incorrect modules (451, 507) - fix confusing lconf 'host not found' error message (386)
-
- Dec 06, 2002
-
-
Phil Schwan authored
- lfind did not preserve OST order in output (443) - symlinks cause hung clients, incorrect data (439) - stop dereferencing request after dropping refcount (457) - don't LASSERT(spin_is_locked) on non-SMP (455) - fixes for many rename() bugs
-
- Dec 05, 2002
-
-
Phil Schwan authored
- LRU counters were broken, causing constant lock purge (433, 432) - the first part of the fixes for the ldlm_handle2lock race (419) - cleanup unused ll_inode_info field (442) - keep ioctl semaphore from serializing test_getattr/brw
-
- Dec 02, 2002
-
-
Andreas Dilger authored
- fixes of many metadata bugs: - multiple client file opens - missing/extra intent releases - symlink fixes - OST/MDS server threads clobbering each other - dbench 2 works mostly - many others too numerous to mention here (ChangeLog is a good record)
-
- Nov 28, 2002
-
-
Mike Shaver authored
Highlights: - b=324: MDS recovery must replay transactions in strict transno sequence - b=325: getattr after OST failure returns -EIO - b=326: unlink after OST failure returns -EIO - b=400: new client can't join cluster after OST failure - b=403: multi-client access failure when OST fails - b=410: After an OST failure, lfind incorrectly displays file information - b=417: Freeing unreplayable requests twice (aed's fix from b_md) - b=402: (partial) give error for lstripe request that exceeds configured OSTs - much better support for reconnecting to MDS after network partition (still some lock-repeating issues to be resolved for some requests) - better support for connecting to multiple MDSes on one host (xid and transno and request_list are all per-import now) - track disconnecting clients in last_rcvd, for more reliable recovery - also, sync last_rcvd after connect/disconnect - reduced syslog/CERROR output for recovery (hi, Terry!) - server (DLM) timeout is half the system-wide timeout, to avoid cascading failure in the face of a dead client - don't wait for recovery to finish in order to send disconnect messages - removal of c_dying_head - don't wait for timeout to trigger recovery after ptl_send_rpc error - strict MDS transno ordering via mds_transno_sem (non-optimal, but correct) - many !handle -> IS_ERR(handle) fixes around mds_fs_start callers. - turn on client-eviction for bulk-timeouts in OST and MDS
-
- Nov 25, 2002
-
-
Phil Schwan authored
- fix dentry->d_it clobbering - credentials weren't being shipped for readdir/getattr operations - remove invalid assertions triggered during some concurrent MD updates - many, many updates to llanalyse
-
- Nov 23, 2002
-
-
Andreas Dilger authored
We just ignore exports with NULL mcd, since they are just being set up. The clients have a semaphore to prevent multiple setups there. We could alternately have a semaphore on the MDS instead of the current spinlock.
-
- Nov 21, 2002
-
-
Andreas Dilger authored
user trying to create a file and getting -EACCES from lookup_one_len().
-
- Nov 18, 2002
-
-
Phil Schwan authored
b=214 b=306 b=385 landing b_brw, which _includes_ all fixes from b_lov. Roughly, this fixes bugs in 3 general areas: - OSC was locking one byte too many; fixed. - several LOV bug fixes, to support adding LOVs and striping over fewer than the maximum number - the brw code was rewritten See the bugs listed above for more details. This will be immediately tagged as 0.5.17.
-
- Nov 13, 2002
-
-
thantry authored
-
thantry authored
to insert two devices with the same ID in a router configuration. 2. Added LDLM counters into /proc/lustre hierarchy 3. Fixes for Andreas's comments in Bug #357. 4. Bug #332 fixed, num_devices replaced with num_refs 5. Removed file proc_lustre.c, which was dead code.
-
- Nov 10, 2002
-
-
Peter Braam authored
in case of UML to aid debugging.
-
- Nov 08, 2002
-
-
Andreas Dilger authored
- reserve highmem pages before we start doing kmaps to avoid deadlocks with multiple threads reserving vectors of pages
-
- Nov 07, 2002
-
-
Andreas Dilger authored
for extN (1 block for each inode, excluding special files).
-
Mike Shaver authored
operations, or intent-only metadata loads will result in unbounded client memory growth.
-
Peter Braam authored
- LL_SAVE_INTENT fixes
-
- Nov 05, 2002
-
-
Phil Schwan authored
- Removes the second LDLM import that Peter added, in favour of doing a last-minute fixup of the portal numbers in ldlm_cli_cancel
-
- Nov 02, 2002
-
-
Mike Shaver authored
generations numbers at recreate-time. Single-client recovery should now be _that_much_ more robust. (Bug 299.)
-
thantry authored
2. Removed typedef 3. lproc-snmp back in.
-
Peter Braam authored
- separate cancel/callback threads to avoid starvation - fix for broken symlinks
-