- Nov 19, 2008
-
-
Alexey Lyashkov authored
Branch b1_6 b=16492 i=umka i=tappro
-
- Nov 17, 2008
-
-
tianzy authored
decay qos ost/oss penalties if MDS is not creating objects i=nathan i=johann
-
- Nov 13, 2008
-
-
Yury Umanets authored
r=adilger,behlendorf1 - avoid div/mod in lustre_hash code
-
Oleg Drokin authored
r=shadow,adilger Lift 4G limit on stripe_size*stripe_count 4G limit on stripe_size remains in place, though.
-
- Nov 12, 2008
-
-
Yury Umanets authored
r=tappro,wangdi - fixes ost cleanup issue due to missed llcd_put() in the case ost does not receive disconnect from mds; - do not sleep on hanging llcd. Instead assert on it _after_ stopping recov_thread's ptlrpcd which should kill any remeining llcds; - fixes and cleanups, comments.
-
- Nov 07, 2008
-
-
Yury Umanets authored
r=adilger,johann - removes deadlock possibility by disabling rehash in hash_del() operations and moving hash_add() out of spin_locks when calling. Hash table has own mechanisms for protecting its structures and it also has hash_add_unique() method for using in concurrent run contexts; - fixed missed lh_put() in hash_add_unique() which led to extra refs in some cases (extra ref to export) and inability to cleanup; - fixed __lustre_hash_set_theta() which set @max theta into ->lh_min_theta; - in lustre_hash_rehash_size() disable rehash also for the case when new and old hash sizes equal in corner cases (max_size or min_size). Before this fix it could be possible to do needless rehashes when size is actually did not change but we do this expensive operation; - disable rehash in hash_add_unique() if no actual add happened since entry with the same key is already found in the table; - some cleanups in hash table code;
-
Johann Lombardi authored
b=16860 i=nathan i=rread Description: Excessive recovery window Details : With AT enabled, the recovery window can be excessively long (6000+ seconds). To address this problem, we no longer use OBD_RECOVERY_FACTOR when extending the recovery window (the connect timeout no longer depends on the service time, it is set to INITIAL_CONNECT_TIMEOUT now) and clients report the old service time via pb_service_time.
-
- Nov 06, 2008
-
-
Yury Umanets authored
r=johann,shadow - fixes ptlrpcd blocking on very long reply unlink waiting. To do so new rpc phase introduced RQ_PHASE_UNREGISTERING in which request stay until we have reply_in_callback() called by lnet signaling that reply is unlinked. All requests in this state are skipped in processing by prlrcd instead of waiting n * 300s on each of them. This allows ptlrpcd to process other rpcs in the set; - make sure that inflight count is coherent with being present on sending or delay list. That is, if we see inflight != 0, rpc must be on one of these lists. This is very helpful in ptlrpc_invalidate_import() to show all rpcs still waiting after invalidating import; - in ptlrpc_invalidate_import() wait maximal rq_deadline - now from all inflight rpcs instead of obd_timeout which may be much longer. If calculated timeout is 0, obd_timeout is used. This fixes the issue that rq_deadline - now > obd_timeout (very easy to see in logs) which led to inflight != 0 assert because inflight rpcs timed out later than our wait period is finished; - in ptlrpc_invalidate_import() wait forever for rpcs in UNREGISTERING phase. Check in assert for inflight == 0 for wait timed out case if no rpcs in UNREGISTERING phase. Only those in UNREGISTERING phase are allowed to stay longer than obd_timeout; - added ptlrpc_move_rqphase() function. All phase changes go through it. Add debug_req() there to track down all phase changes; - conf_sanity.sh test_45 added to emulate very long reply unlink and also situation when rq_deadline - now > obd_timeout; - do not wait forever in ptlrpc_unregister_reply() for async case (using it from sets). sync case left unchanged; - make sure that ptlrpc_set_next_timeout() yields 1s timeout (instead of 0s) for the set with rpcs in "unregistering" stage to prevent ptlrpcd from sleeping forever and hanging in test_45; - in ptlrpcd() make sure that we do not sleep on 0 timeout.
-
- Nov 05, 2008
-
-
Jinshan Xiong authored
b=15715 r=adilger,green Fixed the race of destroying and enqueuing a ldlm lock at OST side.
-
- Nov 04, 2008
-
-
Yury Umanets authored
-
- Nov 03, 2008
-
-
Andrew Perepechko authored
b=17493 i=Andreas Dilger i=Johann Lombardi handling of a broken readonly key
-
- Oct 24, 2008
-
-
Alexey Lyashkov authored
without protection. Branch b1_6 b=16693 i=umka i=tappro
-
- Oct 21, 2008
-
-
Robert Read authored
b=1819 i=adilger i=nathan Add an import file to the osc, mdc, and mgc proc dir, and include test for new proc file.
-
Yury Umanets authored
r=adilger,johann - handle log_cancel resent correctly; - some cleanups in llog.
-
- Oct 17, 2008
-
-
Andreas Dilger authored
Add in OBD_CONNECT flags from b1_8 so that they are not mistakenly used for something else.
-
- Oct 16, 2008
-
-
huanghua authored
b=17403 i=adilger i=yury.umanets create objects in correct directory on OST.
-
- Oct 15, 2008
-
-
Andrew Perepechko authored
b=17302 i=Johann Lombardi i=ZhiYong Tian pass QFMT through qc_id to be compatible with older Lustre versions
-
Andreas Dilger authored
Backport warning fixes from b1_8. i=girish (original patch) i=robert
-
- Oct 13, 2008
-
-
alex authored
i=adilger i=green i=nikita - oss read-only cache feature
-
- Oct 10, 2008
-
-
Nathan Rutman authored
i=adilger i=nathan rename MAXPOOLNAME to LOV_MAXPOOLNAME
-
Bobi Jam authored
b=17309 i=johann backport from bz 16266 for proper using write lock in lprocfs_remove().
-
- Oct 09, 2008
-
-
Yury Umanets authored
r=adilger,tappro - truncate last_rcvd for abort recovey case.
-
- Oct 08, 2008
- Oct 07, 2008
-
-
mattwu authored
i=nikita i=robert winnt libcfs/lnetcleanup
-
- Oct 06, 2008
-
-
Nathan Rutman authored
a=19432 i=manoj.joseph i=vladimir.saveliev add conf_param settings to mdd b=17180 a=19402 i=nathan i=johann rename pools tests to a common base b=17234 a=19563 i=yury.umanets i=nathan set pools for all MDTs for CMD
-
- Oct 01, 2008
-
-
Alexey Lyashkov authored
Branch HEAD b=17157 i=johann i=wangdi
-
- Sep 30, 2008
-
-
Yury Umanets authored
r=adilger,tappro - if aborting recovery, don't force sync IO to last_rcvd file
-
Alexey Lyashkov authored
parallel llog init Branch HEAD b=16679 i=umka
-
Andrew Perepechko authored
Branch b1_6 b=17059 i=ZhiYong Tian i=Johann Lombardi Always build client with quota support
-
- Sep 29, 2008
-
-
Alexey Lyashkov authored
Branch b1_6 b=17157 i=johann i=wangdi
-
- Sep 27, 2008
-
-
Yury Umanets authored
r=adilger,shadow - various CMD related fixes.
-
- Sep 25, 2008
-
-
Yang Sheng authored
b=16771 i=adilger, johann Use fix size timestamp instead of time_t in the cfg_marker and keep compitibility for old cfg_marker on 32bit system.
-
Yang Sheng authored
b=16771 i=adilger, johann Use fix size timestamp instead of time_t in the cfg_marker and keep compitibility for old cfg_marker on 32bit system.
-
- Sep 19, 2008
-
-
Yury Umanets authored
r=shadow - final part of 16776 being already landed to 1.6.x after shadow's inspection.
-
Yury Umanets authored
r=shadow,vitaly,adilger,robert - new class_hash.c. Using new hashtable for connections, quota, held locks on mds, etc.
-
- Sep 18, 2008
-
-
Eric Mei authored
- fix race of handling early reply in sptlrpc. - port AT api changes from b1_8 (b16972) b=16999 r=rread r=wangdi
-
- Sep 16, 2008
-
-
Bobi Jam authored
b=16788 i=adilger i=nathan.rutman add per-nid statistics on HEAD.
-
- Sep 15, 2008
-
-
Nathan Rutman authored
i=nathan i=adilger OST pools on HEAD, comprehensive patch including 17054:19007; 16935:18918,19012,19089,19128; 16978:18872
-
Vitaly Fertman authored
b=15949 i=adilger i=shadow mds_body.capability fix
-