Clint Armstrong
93e4656a27
Use a float to represent GPU utilization
...
Before this change workers can only be allocated one GPU task,
regardless of how much of the GPU resources that task uses, or how many
GPUs are in the system.
This makes GPUUtilization a float which can represent that a task needs
a portion, or multiple GPUs. GPUs are accounted for like RAM and CPUs so
that workers with more GPUs can be allocated more tasks.
A known issue is that PC2 cannot use multiple GPUs. And even if the
worker has multiple GPUs and is allocated multiple PC2 tasks, those
tasks will only run on the first GPU.
This could result in unexpected behavior when a worker with multiple
GPUs is assigned multiple PC2 tasks. But this should not suprise any
existing users who upgrade, as any existing users who run workers with
multiple GPUs should already know this and be running a worker per GPU
for PC2. But now those users have the freedom to customize the GPU
utilization of PC2 to be less than one and effectively run multiple PC2
processes in a single worker.
C2 is capable of utilizing multiple GPUs, and now workers can be
customized for C2 accordingly.
2021-11-30 02:06:58 +01:00
Clint Armstrong
c4f46171ae
Report memory used and swap used in worker res
...
Attempting to report "memory used by other processes" in the MemReserved
field fails to take into account the fact that the system's memory used
includes memory used by ongoing tasks.
To properly account for this, worker should report the memory and swap
used, then the scheduler that is aware of the memory requirements for a
task can determine if there is sufficient memory available for a task.
2021-11-30 02:06:58 +01:00
Jiaying Wang
4d2f3375e8
Merge pull request #7699 from filecoin-project/feat/listcids-verbose
...
Add verbose mode to lotus-miner pieces list-cids
2021-11-29 17:43:09 -05:00
Łukasz Magiera
f8b132890c
Add verbose mode to lotus-miner pieces list-cids
2021-11-29 17:19:47 +01:00
Jakub Sztandera
4d8be81a8f
Add usage
...
Signed-off-by: Jakub Sztandera <kubuxu@protocol.ai>
2021-11-26 15:01:05 +01:00
Jakub Sztandera
e3c7b8d006
Fix typo
...
Co-authored-by: Aayush Rajasekaran <arajasek94@gmail.com>
2021-11-26 14:42:18 +01:00
Jakub Sztandera
af113f867d
Add dump code
...
Signed-off-by: Jakub Sztandera <kubuxu@protocol.ai>
2021-11-25 20:53:49 +01:00
Łukasz Magiera
b522901845
update storage attach allowTo usage
2021-11-23 16:23:06 +01:00
Łukasz Magiera
e362d47903
Add optional group flags to storage attach commands
2021-11-23 16:11:04 +01:00
Łukasz Magiera
b1781c33ce
Show group info in storage list
2021-11-23 16:11:04 +01:00
Łukasz Magiera
51753a1e25
Merge pull request #7414 from filecoin-project/feat/shed-wallet-balancer
...
shed: simple wallet balancer util
2021-11-22 18:11:42 +01:00
Łukasz Magiera
0262f1cd6e
shed: simple wallet balancer util
2021-11-22 14:29:23 +01:00
Łukasz Magiera
81a2f2f06d
Merge pull request #7329 from filecoin-project/feat/lotus-stats-update
...
Add caches to lotus-stats and splitcode
2021-11-22 12:44:00 +01:00
Łukasz Magiera
9adaa9b967
lotus-shed msg: Decode submessages/msig proposals
2021-11-17 23:10:14 +01:00
Aayush Rajasekaran
495f08473d
Shed: Add a util to list miner faults
2021-11-07 23:33:37 +00:00
Aayush Rajasekaran
3be0c068b9
Shed: Add a util to create miners more easily
2021-11-03 00:20:48 +00:00
Travis Person
2d4f5958e2
Add caches to lotus-stats and splitcode
2021-11-01 09:05:14 +00:00
Jennifer Wang
dc6bd2b015
use existing util
2021-10-26 20:53:16 -04:00
Jennifer Wang
18834de7d0
parse attofil to fil
2021-10-26 19:37:29 -04:00
Aayush Rajasekaran
e1deed3b1a
Merge pull request #7544 from filecoin-project/asr/shed
...
Shed: Add a util to find miner based on peerid
2021-10-20 17:55:16 -07:00
Łukasz Magiera
f7884c4f45
Merge pull request #7527 from filecoin-project/feat/show-prepared-waitres
...
Show prepared tasks in sealing jobs
2021-10-20 15:22:35 +01:00
Aayush Rajasekaran
9ba7f372cd
Shed: Add a util to find miner based on peerid
2021-10-19 18:46:09 -07:00
Łukasz Magiera
ba171954b4
Merge pull request #7479 from filecoin-project/feat/prov-check-by-storage
...
Add storage-id flag to proving check
2021-10-19 10:06:30 +01:00
Łukasz Magiera
261238e157
Show prepared tasks in sealing jobs
2021-10-18 18:44:56 +02:00
Jennifer Wang
d1e15e61d0
Merge branch 'release/v1.12.0' into jen/12-master
2021-10-11 14:37:32 -04:00
Jennifer Wang
16743a25c2
print confidence in decimal
2021-10-11 12:34:51 -04:00
dirkmc
f9433811c7
Merge pull request #7454 from filecoin-project/nonsense/retry-in-publish-deal
...
itests: retry deal when control addr is out of funds
2021-10-11 15:14:27 +02:00
Jakub Sztandera
61e1297cc7
gofmt
...
Signed-off-by: Jakub Sztandera <kubuxu@protocol.ai>
2021-10-11 08:24:55 -04:00
Jakub Sztandera
e40c7b7a86
Fix withdrawn amount equiality check
...
Co-authored-by: Jiaying Wang <42981373+jennijuju@users.noreply.github.com>
2021-10-11 08:24:33 -04:00
Jakub Sztandera
177033d0f2
Fix withdrawn amount equiality check
...
Co-authored-by: Jiaying Wang <42981373+jennijuju@users.noreply.github.com>
2021-10-11 08:24:24 -04:00
Aayush Rajasekaran
cab724841a
Update cmd/lotus-miner/actor.go
...
Co-authored-by: Jiaying Wang <42981373+jennijuju@users.noreply.github.com>
2021-10-11 08:24:00 -04:00
Aayush Rajasekaran
2241f02621
Bugfix: actor withdraw CLI should depend on network version
2021-10-11 08:23:53 -04:00
Jakub Sztandera
5abba9ca9f
gofmt
...
Signed-off-by: Jakub Sztandera <kubuxu@protocol.ai>
2021-10-11 14:10:06 +02:00
Anton Evangelatov
1acb6b4328
update CLI command to take arg; reduce time for itest
2021-10-11 14:03:56 +02:00
Anton Evangelatov
6e5ccc87cf
cli: add retry for deals stuck in Publish with no funds
2021-10-11 14:03:55 +02:00
Jakub Sztandera
8533ae0032
Fix withdrawn amount equiality check
...
Co-authored-by: Jiaying Wang <42981373+jennijuju@users.noreply.github.com>
2021-10-11 14:03:28 +02:00
Jakub Sztandera
90ff1d3ccf
Fix withdrawn amount equiality check
...
Co-authored-by: Jiaying Wang <42981373+jennijuju@users.noreply.github.com>
2021-10-11 14:03:21 +02:00
Aayush Rajasekaran
be774f55ce
Update cmd/lotus-miner/actor.go
...
Co-authored-by: Jiaying Wang <42981373+jennijuju@users.noreply.github.com>
2021-10-10 10:40:22 -04:00
Aayush Rajasekaran
e158b1d1e4
Bugfix: actor withdraw CLI should depend on network version
2021-10-09 20:17:03 -04:00
Łukasz Magiera
bfcfb199a0
Add storage-id flag to proving check
2021-10-08 14:56:08 +02:00
Peter Rabbitson
8b01725129
Dep upgrade pass
...
Touching just the seemingly non-controversial ones
2021-10-08 12:37:40 +02:00
Aayush Rajasekaran
89db4235ce
Shed: Add a util to find the most recent null tipset
2021-10-06 19:37:35 -04:00
Jennifer Wang
38e01cba83
Merge branch 'release/v1.12.0' into jen/12tomaster
2021-10-04 02:38:00 -04:00
Aayush Rajasekaran
a335cb3767
Update to latest actors
2021-10-01 17:11:05 -04:00
Łukasz Magiera
a8a9818043
Expose storage states on the metrics endpoint
2021-10-01 14:45:01 +02:00
Łukasz Magiera
95e8b59367
Merge pull request #7341 from filecoin-project/mg/feat/panic-reporter
...
feat: Catch panic to generate report and reraise
2021-10-01 10:50:19 +01:00
Mike Greenberg
2f8a2fc907
fix: Update lotusminer default paths
...
Co-authored-by: Łukasz Magiera <magik6k@users.noreply.github.com>
2021-09-30 09:06:55 -04:00
Aayush Rajasekaran
2bafdf7271
Modify tooling to support new WithdrawBalance return
2021-09-29 13:01:04 -04:00
Aayush Rajasekaran
c3c46e9097
Fix Drand fetching around null tipsets
2021-09-29 11:57:59 -04:00
Łukasz Magiera
83c8e981b4
lotus shed: fr32 utils
2021-09-27 18:31:36 +02:00