Skip to content
Snippets Groups Projects
  1. Dec 03, 2003
  2. Sep 12, 2003
  3. Aug 12, 2003
  4. Aug 08, 2003
  5. Jul 25, 2003
  6. Jul 03, 2003
  7. Jun 26, 2003
  8. Jun 12, 2003
  9. Mar 02, 2003
  10. Feb 11, 2003
    • Phil Schwan's avatar
      b=850 · 41f1b17a
      Phil Schwan authored
      Call LDLM_DEBUG0 instead of LDLM_DEBUG when there are no arguments.
      This is a temporary workaround for gcc 3.2 that I really hate.
      41f1b17a
  11. Feb 07, 2003
    • Phil Schwan's avatar
      Merge b_md into HEAD · 93acd158
      Phil Schwan authored
      * bug fixes
       - Fix ldlm_lock_match on the MDS to avoid matching remote locks (592)
       - Fix fsfilt_extN_readpage() to read a full page of directory
         entries, or fake the remainder if PAGE_SIZE != blocksize (500)
       - Avoid extra mdc_getattr() in ll_intent_lock when possible (534, 604)
       - Fix imbalanced LOV object allocation and out-of-bound access (469)
       - Most intent operations were removed, in favour of a new RPC mode
         that does a single RPC to the server and bypasses most of the VFS
       - All LDLM resource ID arrays were removed in favour of ldlm_res_id
       - Aggressively cancel local locks on DLM servers
       - mds_reint_unlink sends EA to the client if it's the last nlink.
         client uses that EA to unlink OST objects.
       - mds_reint_{rename,unlink,link} were rewritten to take ordered locks
       - recursive symlinks were fixed (440)
       - fixed NULL deref in DEBUG_REQ
       - filter_update_lastobjid no longer calls sync, which annoyed extN
       - fixed multi-client small-writes to a single file problem (445)
       - fixed mtime updates during file writes (607)
       - fixed vector writes on obdfilter causing problems when ENOSPC (670)
       - fixed bug in obd_brw_read/write() (under guise of testing 367)
       - fixed Linux OST size reporting problem (444, 656)
       - OST now updates object mtime with writes or setattr (607, 619)
       - client verifies file size before zeroing page past EOF (445)
       - OST now writes last allocated objid to disk with allocation (108)
       - LOV on echo now works (409)
       * protocol changes
       - mds_reint_unlink sends a new buffer, with the EA included.  this
         buffer is only valid if body->valid & OBD_MD_FLEASIZE, which is only
         set if a regular file was being unlinked, and it was the last link
       - use PtlGet from the target for bulk writes (315)
       - OST now updates object mtime with writes or setattr (607, 619)
       - LDLM now has a grant-time callback to revalidate locked items, if
         necessary (604)
       - Many MDS operations were reorganized to combat race conditions
      * other changes
       - Merge b_intel branch (updated lprocfs code) - now at /proc/fs/lustre
       - configure check to avoid gcc version 2.96 20000731-2.96-98 (606)
      93acd158
  12. Jan 06, 2003
    • Andreas Dilger's avatar
      Merge b_md to HEAD for 0.5.19 release. · ccb42f24
      Andreas Dilger authored
      Fixes a _huge_ number of bugs:
        - Fully reactivate OST imports after reconnection (512, others)
        - Make sure client sees our -ENOTCONN from mds_handle (513 - partial)
        - More graceful error handling for truncating on dead OST (515)
        - Don't error out unless we're actually accessing dead stripes (474)
        - Fix garbage sizes when stripes are missing (410)
        - LRU counters were broken, causing constant lock purge (433, 432)
        - garbage on read from stripes with failed OSTs (441)
        - mark OSCs as active before reconnecting during recovery (438)
        - lov_enqueue and lov_cancel need to handle inactive OSTs (403)
        - lfind did not preserve OST order in output (443)
        - symlinks cause hung clients, incorrect data (439)
        - stop dereferencing request after dropping refcount (457)
        - don't LASSERT(spin_is_locked) on non-SMP (455)
        - fixes for many rename() bugs
        - fstat didn't correctly synchronize attributes (399)
        - server must handle lock cancellation during blocking AST prep (487)
        - bulk descriptors were free()d too soon (511)
        - fix paths in lconf, which would load incorrect modules (451, 507)
        - fix confusing lconf 'host not found' error message (386)
        - fix lock order deadlock on OST (O/R i_sem before journal ops, 478)
        - fix race condition in mdc_blocking_ast() for inode access (526)
        - fix lov_unpackmd() unpacking wrong number of stripes (537)
        - fix lov_set_osc_active() marking wrong OSC inactive (440)
        - fix bad lstripe lov_unpackmd() assertion (fix layering too) (527)
        - fix multiple writes of stripe MD to MDS (358, maybe 519)
        - fix lstripe in several ways (kernel side) (527)
        - fix request leak in ldlm_cli_enqueue (262)
        - incorrect OSC was marked inactive after OST failure
        - call mds_fs_cleanup before unmounting filesystem (524)
        - fix races between taking ns_lock and ldlm_lock_change_resource
        - fix races updating LOV export open file list
        - fix lov_enqueue error path, avoid decref-ing bad lock handle
        - fix recovery NULL deref in ldlm_cli_cancel_unused
        - fix some DLM races by using new hash table for lock handles (419)
        - permit the client to specify desired inodes, at replay
        - duplicate requests when we queue them for replay reintegration
        - fix last_rcvd offset calculation
        - sync after each recovered transaction, so we always make progress
        - never, not always, ERESTART requests without transnos
        - store the lov_desc in the MDS, so we don't depend on getlovinfo to set it
        - skip replay if the MDS says that the client is already connected
        - don't check for a recovery-enabled export to match lctl's UUID
        - don't INC_USE_COUNT for phantom exports
        - don't crash when cleaning up phantom exports (567)
        - don't double-finish or set replay data for errored mdc_open requests
        - abort requests when they time out, so we don't get old replies
        - send/receive replies for AST messages again
        - if the client says that it doesn't have the lock, cancel it on the server
        - if we timeout during I/O, don't try to cancel an in-use lock; instead
          mark it as destroyed, it will all work out when decref is called
        - fix module use counts (22, 581)
       * protocol changes
        - ASTs now expect a reply (server cancels lock on error reply)
      ccb42f24
  13. Dec 17, 2002
    • Phil Schwan's avatar
      · 26969941
      Phil Schwan authored
      land b_md onto HEAD.  almost entirely small cleanups and miscellaneous fixes,
      plus:
      
      - fix lock order deadlock on OST (O/R i_sem before journal ops, 478)
      - fix follow_symlink bugs (437)
      - preparation for some ldlm race fixes (419)
      26969941
  14. Dec 16, 2002
    • Mike Shaver's avatar
      Land b_recovery (other than WIP lock-replay work). · 91cda45a
      Mike Shaver authored
      - b=512,others: fully reactivate OST imports after reconnection.
      - b=513(partial): make sure client sees out -ENOTCONN from mds_handle
      - b=515: more graceful error handling for truncating on dead OST
      - b=474: don't error out just because a file is striped across a dead OST; only
               if we actually try to access it
      - b=410: fix garbage sizes when stat(2)ing a file that includes a stripe on a
               dead OST
      - print console diagnostic for completion-timeout client recovery
      - MSG_RESENT for requests that were retransmitted
      91cda45a
  15. Dec 14, 2002
    • Phil Schwan's avatar
      land b_md onto HEAD. the highlights: · cc669f5d
      Phil Schwan authored
       - fstat didn't correctly synchronize attributes (399)
       - server must handle lock cancellation during blocking AST prep (487)
       - bulk descriptors were free()d too soon (511)
       - fix paths in lconf, which would load incorrect modules (451, 507)
       - fix confusing lconf 'host not found' error message (386)
      cc669f5d
  16. Dec 06, 2002
    • Phil Schwan's avatar
      merge b_md into HEAD: · b191d0b1
      Phil Schwan authored
       - lfind did not preserve OST order in output (443)
       - symlinks cause hung clients, incorrect data (439)
       - stop dereferencing request after dropping refcount (457)
       - don't LASSERT(spin_is_locked) on non-SMP (455)
       - fixes for many rename() bugs
      b191d0b1
  17. Dec 05, 2002
    • Phil Schwan's avatar
      land b_md onto HEAD: · 4b285975
      Phil Schwan authored
       - LRU counters were broken, causing constant lock purge (433, 432)
       - the first part of the fixes for the ldlm_handle2lock race (419)
       - cleanup unused ll_inode_info field (442)
       - keep ioctl semaphore from serializing test_getattr/brw
      4b285975
  18. Dec 02, 2002
    • Andreas Dilger's avatar
      Merge of b_md to HEAD: · 73126167
      Andreas Dilger authored
      - fixes of many metadata bugs:
        - multiple client file opens
        - missing/extra intent releases
        - symlink fixes
      - OST/MDS server threads clobbering each other
        - dbench 2 works mostly
      - many others too numerous to mention here (ChangeLog is a good record)
      73126167
  19. Nov 28, 2002
    • Mike Shaver's avatar
      Landing of b_recovery (at last). · 4d477d14
      Mike Shaver authored
      Highlights:
        - b=324: MDS recovery must replay transactions in strict transno sequence
        - b=325: getattr after OST failure returns -EIO
        - b=326: unlink after OST failure returns -EIO
        - b=400: new client can't join cluster after OST failure
        - b=403: multi-client access failure when OST fails
        - b=410: After an OST failure, lfind incorrectly displays file information
        - b=417: Freeing unreplayable requests twice (aed's fix from b_md)
        - b=402: (partial) give error for lstripe request that exceeds configured OSTs
        - much better support for reconnecting to MDS after network partition
          (still some lock-repeating issues to be resolved for some requests)
        - better support for connecting to multiple MDSes on one host (xid and
          transno and request_list are all per-import now)
        - track disconnecting clients in last_rcvd, for more reliable recovery
        - also, sync last_rcvd after connect/disconnect
        - reduced syslog/CERROR output for recovery (hi, Terry!)
        - server (DLM) timeout is half the system-wide timeout, to avoid cascading
          failure in the face of a dead client
        - don't wait for recovery to finish in order to send disconnect messages
        - removal of c_dying_head
        - don't wait for timeout to trigger recovery after ptl_send_rpc error
        - strict MDS transno ordering via mds_transno_sem (non-optimal, but correct)
        - many !handle -> IS_ERR(handle) fixes around mds_fs_start callers.
        - turn on client-eviction for bulk-timeouts in OST and MDS
      4d477d14
  20. Nov 25, 2002
    • Phil Schwan's avatar
      merge b_md onto HEAD. as best as I can remember: · 1f5c8cf8
      Phil Schwan authored
      - fix dentry->d_it clobbering
      - credentials weren't being shipped for readdir/getattr operations
      - remove invalid assertions triggered during some concurrent MD updates
      - many, many updates to llanalyse
      1f5c8cf8
  21. Nov 23, 2002
  22. Nov 21, 2002
  23. Nov 18, 2002
    • Phil Schwan's avatar
      b=191 · 67e9e13a
      Phil Schwan authored
      b=214
      b=306
      b=385
      landing b_brw, which _includes_ all fixes from b_lov.
      
      Roughly, this fixes bugs in 3 general areas:
      
      - OSC was locking one byte too many; fixed.
      
      - several LOV bug fixes, to support adding LOVs and striping over fewer
        than the maximum number
      
      - the brw code was rewritten
      
      See the bugs listed above for more details.  This will be immediately
      tagged as 0.5.17.
      67e9e13a
  24. Nov 13, 2002
  25. Nov 10, 2002
  26. Nov 08, 2002
  27. Nov 07, 2002
  28. Nov 05, 2002
  29. Nov 02, 2002
Loading