Skip to content
Snippets Groups Projects
  1. Nov 06, 2008
    • Yury Umanets's avatar
      b=17310 · 8c981415
      Yury Umanets authored
      r=johann,shadow
      
      - fixes ptlrpcd blocking on very long reply unlink waiting. To do so new rpc phase introduced
      RQ_PHASE_UNREGISTERING in which request stay until we have reply_in_callback() called by lnet
      signaling that reply is unlinked. All requests in this state are skipped in processing by prlrcd
      instead of waiting n * 300s on each of them. This allows ptlrpcd to process other rpcs in the set;
      
      - make sure that inflight count is coherent with being present on sending or delay list. That is,
      if we see inflight != 0, rpc must be on one of these lists. This is very helpful in
      ptlrpc_invalidate_import() to show all rpcs still waiting after invalidating import;
      
      - in ptlrpc_invalidate_import() wait maximal rq_deadline - now from all inflight rpcs instead of
      obd_timeout which may be much longer. If calculated timeout is 0, obd_timeout is used. This fixes
      the issue that rq_deadline - now > obd_timeout (very easy to see in logs) which led to inflight !=
      0 assert because inflight rpcs timed out later than our wait period is finished;
      
      - in ptlrpc_invalidate_import() wait forever for rpcs in UNREGISTERING phase. Check in assert for
      inflight == 0 for wait timed out case if no rpcs in UNREGISTERING phase. Only those in
      UNREGISTERING phase are allowed to stay longer than obd_timeout;
      
      - added ptlrpc_move_rqphase() function. All phase changes go through it. Add debug_req() there to
      track down all phase changes;
      
      - conf_sanity.sh test_45 added to emulate very long reply unlink and also situation when
      rq_deadline - now > obd_timeout;
      
      - do not wait forever in ptlrpc_unregister_reply() for async case (using it from sets). sync case
      left unchanged;
      
      - make sure that ptlrpc_set_next_timeout() yields 1s timeout (instead of 0s) for the set with rpcs
      in "unregistering" stage to prevent ptlrpcd from sleeping forever and hanging in test_45;
      
      - in ptlrpcd() make sure that we do not sleep on 0 timeout.
      8c981415
  2. Nov 05, 2008
    • Andrew Perepechko's avatar
      Branch b1_6 · a5fa4e1d
      Andrew Perepechko authored
      b=17371
      i=Johann Lombardi
      i=Oleg Drokin
      
      fix a race between requeue thread processing and umount
      a5fa4e1d
    • Elena Gryaznova's avatar
      b=16551 · 5bf79c45
      Elena Gryaznova authored
      i\Adilger
      correct remote_[mds|ost] fn to work correctly on configuration
      with several MDS/OSS nodes
      5bf79c45
    • kalpak's avatar
      · b1907268
      kalpak authored
      b=16438
      i=adilger
      i=girish
      
      Mounting a filesystem with extents feature will fail on big-endian systems since ext3-based ldiskfs is not supported on big-endian systems. This can be over-riden with "bigendian_extents" mount option.
      b1907268
    • Jinshan Xiong's avatar
      · ee927013
      Jinshan Xiong authored
      b=15715
      r=adilger,green
      
      Fixed the race of destroying and enqueuing a ldlm lock at OST side.
      ee927013
    • Bobi Jam's avatar
      Branch b1_6 · e9d306ad
      Bobi Jam authored
      b=16578
      i=adilger
      
      Description: ldlm_cancel_pack()) ASSERTION(max >= dlm->lock_count + count)
      Details    : If there is no extra space in the request for early cancels,
                   ldlm_req_handles_avail() returns 0 instead of a negative value.
      e9d306ad
    • Liu Ying's avatar
      *** empty log message *** · f391b34e
      Liu Ying authored
      f391b34e
  3. Nov 04, 2008
  4. Nov 03, 2008
    • Mikhail Pershin's avatar
      - test fix from 12512 · 1a2954ab
      Mikhail Pershin authored
        b:12512
        i:grev, adilger
      1a2954ab
    • Andrew Perepechko's avatar
      · cfa82133
      Andrew Perepechko authored
      b=17493
      i=Andreas Dilger
      i=Johann Lombardi
      handling of a broken readonly key
      cfa82133
    • tianzy's avatar
      Branch b1_6 · ff76c1dc
      tianzy authored
      fix an error in the test_18 of sanity-quota.sh
      b=17523
      i=johann
      i=panda
      ff76c1dc
    • Andreas Dilger's avatar
      Branch b1_6 · 542edff1
      Andreas Dilger authored
      Quiet compiler warning about unused label.
      Conditional check will be optimized away by compiler.
      542edff1
    • Andreas Dilger's avatar
      Branch b1_6 · 99eee3f6
      Andreas Dilger authored
      Fix 80-column line wrapping.
      99eee3f6
  5. Oct 31, 2008
  6. Oct 30, 2008
    • girish's avatar
      Remove the LBUG and instead, return an error if npages > OST_THREAD_POOL_SIZE · f48750a9
      girish authored
      i=johann
      i=adilger
      b=17448
      f48750a9
    • Bobi Jam's avatar
      · 78186c94
      Bobi Jam authored
      Branch b1_6
      b=16887
      i=pravin.shelar
      i=adilger
      
      Address LBUG, ASSERTION(client_stat->nid_exp_ref_count == 0) failed:count -1
      * add client stat on obd_nid_stat after client stat is ready.
      * properly decrease exp_nid_stats' nid_exp_ref_count in lprocfs_exp_cleanup().
      78186c94
  7. Oct 29, 2008
    • Wang Di's avatar
      Branch: b1_6 · aaf77778
      Wang Di authored
      remove unecessary return.
      aaf77778
    • Wang Di's avatar
      Branch: b1_6 · 34674a36
      Wang Di authored
      Once the unmatched stride IO mode is detected, shrink the stride-ahead window to 0.
      if it does hit cache miss, and read-pattern is still stride-io mode,
      does not reset the stride window, but also does not increase the stride
      window length in this case.
      b=17197
      i=Nikita
      i=Andreas
      34674a36
    • Nathan Rutman's avatar
      b=15899 · 3e04fca3
      Nathan Rutman authored
      i=johann
      i=adilger
      coverity fix.  thought I landed this awhile ago, but apparently not...
      3e04fca3
  8. Oct 28, 2008
    • Robert Read's avatar
      Branch b1_6 · 448f828a
      Robert Read authored
      b=17491
      i=nathan
      i=rread
      
      Quick fix patch from behlendorf1@llnl.gov.
      448f828a
    • kalpak's avatar
      · f1e739e5
      kalpak authored
      b=16680
      i=adilger, kalpak (o=bzzz)
      
      Detect on-disk corruption of block bitmap and better checking of preallocated blocks.
      f1e739e5
    • Johann Lombardi's avatar
      Branch b1_6 · 960c808e
      Johann Lombardi authored
      b=17089
      i=wangdi
      
      fix mistake made when the patch was landed.
      960c808e
    • Bobi Jam's avatar
      Branch b1_6 · 3edda35c
      Bobi Jam authored
      b=17093
      o=Btian Behlendorf (behlendorf1@llnl.gov) for mgs part
      o=E. Gryaznova (grev) for test framework
      i=nathan.rutman
      i=adilger
      
      Do writeconf only explicitly required.
      3edda35c
  9. Oct 27, 2008
    • Johann Lombardi's avatar
      Branch b1_6 · bc4e78a6
      Johann Lombardi authored
      b=17385
      i=green
      i=shadow
      
      grab lock reference when the lock is added to the waiting or expired list.
      bc4e78a6
    • Hongchao Zhang's avatar
      Branch b1_6 · bd78a8a4
      Hongchao Zhang authored
      handle the problem of test_120a in sanity.sh: 1 cancel RPC occured
      
      b=14502
      i=adilger
      i=grev
      bd78a8a4
    • Hongchao Zhang's avatar
      Branch b1_6 · 0fdbc45f
      Hongchao Zhang authored
      handle "Unexpected: can't find mdc_open_data,
      but the close succeed, Please tell <http://bugzilla.lustre.org/>."
      printed in mdc_close in mdc_request.c
      b=17089
      i=johann
      0fdbc45f
  10. Oct 24, 2008
Loading