Skip to content
Snippets Groups Projects
  1. Nov 24, 2008
    • Elena Gryaznova's avatar
      b=17747 · f8518f8d
      Elena Gryaznova authored
      i=Tappro
      FAIL_ON_ERROR=false fix: force suits exit 1 if some tests failed
      f8518f8d
  2. Nov 23, 2008
    • Yury Umanets's avatar
      b=17631 · 05245c55
      Yury Umanets authored
      r=panda,shadow
      
      - fixes possible sync long bulk unlink in ptlrpcd which would lead to assertion in forced umount time. Basically the fix is identical to 17310 where we move req to special phase UNREGISTERING and go processing other rpcs until bulk unlink is done;
      
      - in sync bulk and reply unlink we check for wakeup condition every 1 sec to act quckly if unlink come instead of doing it every 20 sec as before.
      05245c55
    • Yury Umanets's avatar
      - commit missed bit from previous commit. · b7a74ce8
      Yury Umanets authored
      b7a74ce8
    • Yury Umanets's avatar
      b=17690 · d3f2af57
      Yury Umanets authored
      r=shadow
      
      - fixes in replay-single.sh test_59b
      d3f2af57
    • Yury Umanets's avatar
      b=17751 · e76ef895
      Yury Umanets authored
      r=grev
      
      - fixes and cleanups in test_124a from sanity.sh
      e76ef895
  3. Nov 21, 2008
    • Elena Gryaznova's avatar
      b=17735 · 0db7ffd4
      Elena Gryaznova authored
      i=Yury.Umanets
      check_mem_leak fn fix: use echo instead of log fn
      0db7ffd4
  4. Nov 20, 2008
    • Vitaly Fertman's avatar
      Branch b1_6 · 8b8991b7
      Vitaly Fertman authored
      b=16129
      i=adilger
      i=green
      
      - a high priority request list is added into service;
      - once a lock is canceled, all the IO requests, including coming
      ones, under this lock, are moved into this list;
      - PING is also added into this list;
      - once a lock cancel timeout occurs, the timeout is prolonged
      if there is an IO rpc under this lock;
      - another request list is added into the export, used to speed up
      the rpc-lock matching.
      8b8991b7
    • Andrew Perepechko's avatar
      Branch b1_6 · 9db41719
      Andrew Perepechko authored
      b=17371
      i=Johann Lombardi
      
      testcase for 17371
      9db41719
  5. Nov 19, 2008
  6. Nov 14, 2008
  7. Nov 13, 2008
    • Oleg Drokin's avatar
      b=16823 · 0b41127b
      Oleg Drokin authored
      r=shadow,adilger
      
      Lift 4G limit on stripe_size*stripe_count
      4G limit on stripe_size remains in place, though.
      0b41127b
  8. Nov 12, 2008
    • Elena Gryaznova's avatar
      b=17634 · 8a0375e6
      Elena Gryaznova authored
      i=Yury.Umanets
      insanity cleanup (remove dup fn, sync with HEAD t-f)
      8a0375e6
    • Elena Gryaznova's avatar
      b=16488 · c448adfa
      Elena Gryaznova authored
      i=Oleg.Drokin
      new runracer script
      c448adfa
    • Elena Gryaznova's avatar
      b=17555 · 3a5105ef
      Elena Gryaznova authored
      i=Adilger
      use current config instead of reformat fs to have a single ost
      3a5105ef
  9. Nov 11, 2008
    • Elena Gryaznova's avatar
      b=16551 · ee78b870
      Elena Gryaznova authored
      i=Adilger
      conf-sanity test_32* fix to not be skipped for remote setup
      ee78b870
  10. Nov 07, 2008
    • Elena Gryaznova's avatar
      b=17477 · 45671edb
      Elena Gryaznova authored
      i=Adilger
      replace cleanup_and_setup_lustre fn by check_and_setup_lustre fn
      45671edb
    • Elena Gryaznova's avatar
      b=17477 · f04bb1d7
      Elena Gryaznova authored
      i=Adilger
      check config if lustre is mounted before acc-sm run
      f04bb1d7
    • Elena Gryaznova's avatar
      b=14384 · ac21c0e8
      Elena Gryaznova authored
      i=Brian
      assert_DIR cleanup
      ac21c0e8
    • Bobi Jam's avatar
      Branch b1_6 · 7770cb12
      Bobi Jam authored
      b=16578
      o=adilger
      
      A faster way to get long string.
      7770cb12
  11. Nov 06, 2008
    • Yury Umanets's avatar
      b=17310 · 8c981415
      Yury Umanets authored
      r=johann,shadow
      
      - fixes ptlrpcd blocking on very long reply unlink waiting. To do so new rpc phase introduced
      RQ_PHASE_UNREGISTERING in which request stay until we have reply_in_callback() called by lnet
      signaling that reply is unlinked. All requests in this state are skipped in processing by prlrcd
      instead of waiting n * 300s on each of them. This allows ptlrpcd to process other rpcs in the set;
      
      - make sure that inflight count is coherent with being present on sending or delay list. That is,
      if we see inflight != 0, rpc must be on one of these lists. This is very helpful in
      ptlrpc_invalidate_import() to show all rpcs still waiting after invalidating import;
      
      - in ptlrpc_invalidate_import() wait maximal rq_deadline - now from all inflight rpcs instead of
      obd_timeout which may be much longer. If calculated timeout is 0, obd_timeout is used. This fixes
      the issue that rq_deadline - now > obd_timeout (very easy to see in logs) which led to inflight !=
      0 assert because inflight rpcs timed out later than our wait period is finished;
      
      - in ptlrpc_invalidate_import() wait forever for rpcs in UNREGISTERING phase. Check in assert for
      inflight == 0 for wait timed out case if no rpcs in UNREGISTERING phase. Only those in
      UNREGISTERING phase are allowed to stay longer than obd_timeout;
      
      - added ptlrpc_move_rqphase() function. All phase changes go through it. Add debug_req() there to
      track down all phase changes;
      
      - conf_sanity.sh test_45 added to emulate very long reply unlink and also situation when
      rq_deadline - now > obd_timeout;
      
      - do not wait forever in ptlrpc_unregister_reply() for async case (using it from sets). sync case
      left unchanged;
      
      - make sure that ptlrpc_set_next_timeout() yields 1s timeout (instead of 0s) for the set with rpcs
      in "unregistering" stage to prevent ptlrpcd from sleeping forever and hanging in test_45;
      
      - in ptlrpcd() make sure that we do not sleep on 0 timeout.
      8c981415
  12. Nov 05, 2008
    • Elena Gryaznova's avatar
      b=16551 · 5bf79c45
      Elena Gryaznova authored
      i\Adilger
      correct remote_[mds|ost] fn to work correctly on configuration
      with several MDS/OSS nodes
      5bf79c45
    • Jinshan Xiong's avatar
      · ee927013
      Jinshan Xiong authored
      b=15715
      r=adilger,green
      
      Fixed the race of destroying and enqueuing a ldlm lock at OST side.
      ee927013
    • Bobi Jam's avatar
      Branch b1_6 · e9d306ad
      Bobi Jam authored
      b=16578
      i=adilger
      
      Description: ldlm_cancel_pack()) ASSERTION(max >= dlm->lock_count + count)
      Details    : If there is no extra space in the request for early cancels,
                   ldlm_req_handles_avail() returns 0 instead of a negative value.
      e9d306ad
  13. Nov 03, 2008
  14. Oct 31, 2008
  15. Oct 28, 2008
    • Bobi Jam's avatar
      Branch b1_6 · 3edda35c
      Bobi Jam authored
      b=17093
      o=Btian Behlendorf (behlendorf1@llnl.gov) for mgs part
      o=E. Gryaznova (grev) for test framework
      i=nathan.rutman
      i=adilger
      
      Do writeconf only explicitly required.
      3edda35c
  16. Oct 27, 2008
    • Hongchao Zhang's avatar
      Branch b1_6 · bd78a8a4
      Hongchao Zhang authored
      handle the problem of test_120a in sanity.sh: 1 cancel RPC occured
      
      b=14502
      i=adilger
      i=grev
      bd78a8a4
  17. Oct 24, 2008
    • Elena Gryaznova's avatar
      b=17477 · 45c80df5
      Elena Gryaznova authored
      i=Adilger
      i=Tappro
      init facets vars for mounted lustre
      45c80df5
    • Elena Gryaznova's avatar
      b=16551 · 2327e349
      Elena Gryaznova authored
      i=Nathan
      skip replay/recovery tests if remote MDS/OSS with nodsh
      lost_test55 fix
      2327e349
  18. Oct 23, 2008
    • Elena Gryaznova's avatar
      b=16551 · 29b5efda
      Elena Gryaznova authored
      i=Nathan
      
      skip replay/recovery tests if remote MDS/OSS with nodsh
      lost test_27 fix
      29b5efda
    • Yury Umanets's avatar
      b=17323 · 5d9e2961
      Yury Umanets authored
      r=tappro
      
      - small fix from Mike
      5d9e2961
  19. Oct 22, 2008
    • Elena Gryaznova's avatar
      b=15711 · c76ea477
      Elena Gryaznova authored
      i=Johann
      recovery-small exception for FAILURE_MODE=HARD and mixed ost devices
      c76ea477
Loading