FS#64574 - [tensorflow] OneDeviceStrategy/MirrowedStrategy broken after upgrade to 2.0.0.5

Attached to Project: Community Packages
Opened by Oliver Kowalke (olk) - Tuesday, 19 November 2019, 20:26 GMT
Last edited by Sven-Hendrik Haase (Svenstaro) - Thursday, 12 December 2019, 05:31 GMT
Task Type Bug Report
Category Packages
Status Closed
Assigned To Sven-Hendrik Haase (Svenstaro)
Konstantin Gizdov (kgizdov)
Architecture All
Severity Low
Priority Normal
Reported Version
Due in Version Undecided
Due Date Undecided
Percent Complete 100%
Votes 0
Private No

Details

Description: Using OneDeviceStrategy(device='/gpu:0') results in error "RuntimeError: /job:localhost/replica:0/task:0/device:GPU:0 unknown device." after upgrading to version 2.0.0.5.
MirroredStrategy() uses CPU instread of GPU. Maybe an upstream bug.


Additional info:
* 2.0.0.5

Steps to reproduce: run example that uses a strategy:

strategy = tf.distribute.OneDeviceStrategy(device="/gpu:0")
#strategy = tf.distribute.MirroredStrategy()
with strategy.scope():
network_layers = small.layers(input_shape)
This task depends upon

Closed by  Sven-Hendrik Haase (Svenstaro)
Thursday, 12 December 2019, 05:31 GMT
Reason for closing:  Fixed
Comment by Oliver Kowalke (olk) - Tuesday, 19 November 2019, 20:41 GMT
same code works with version 2.0.0-2
Comment by Sven-Hendrik Haase (Svenstaro) - Wednesday, 20 November 2019, 18:05 GMT
Can you check with the archive which version makes it break exactly? That would be very helpful indeed.
Comment by Oliver Kowalke (olk) - Wednesday, 20 November 2019, 18:54 GMT
2.0.0.2 works
2.0.0.4 was broken because of Python-3.8 update (see  FS#64528 )
So 2.0.0.3 needs to be tested - could you tell me at which date 2.0.0.4 was updated?
Because I do a complete system downgrade by pinning pacman to a specific date - I'd like to choose a date for downgrading one day before 2.0.0.4.
Comment by Oliver Kowalke (olk) - Saturday, 23 November 2019, 17:39 GMT
2.0.0.6 fails too
Comment by Sven-Hendrik Haase (Svenstaro) - Tuesday, 03 December 2019, 14:59 GMT
Can you test against 2.1.0rc0?
Comment by Oliver Kowalke (olk) - Tuesday, 03 December 2019, 18:14 GMT
2.1.0rc0 seams to fix the issue

Could you concider to add a Arch package for keras-tuner and tensorflow_dataset?
Comment by Sven-Hendrik Haase (Svenstaro) - Thursday, 12 December 2019, 05:31 GMT
Please open new bug reports for your requests. It's hard enough to track this as it stands. :P

Loading...