Package Details: python-apex-git 0.1.r599-1

Git Clone URL: (read-only, click to copy)
Package Base: python-apex-git
Description: A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Upstream URL:
Keywords: pytorch
Licenses: BSD
Conflicts: python-apex
Provides: python-apex
Submitter: leomao
Maintainer: leomao
Last Packager: leomao
Votes: 0
Popularity: 0.000000
First Submitted: 2018-12-14 06:07
Last Updated: 2019-11-27 05:42

Latest Comments

« First ‹ Previous 1 2

leomao commented on 2019-04-12 09:55

Please check Currently, I don't have a solution with pytorch/pytorch-cuda in the community repo...

For now, I compile pytorch master myself...

drr21 commented on 2019-04-09 16:05

I get this warning when I use apex.amp:

'Warning: multi_tensor_applier fused unscale kernel is unavailable, possibly because apex was installed without --cuda_ext --cpp_ext. Using Python fallback. Original ImportError was: ImportError('/usr/lib/python3.7/site-packages/ undefined symbol: _ZN3c105ErrorC1ENS_14SourceLocationERKSs')'

hottea commented on 2019-03-22 08:41

@leomao see syncbn for syncbn example. Actually, see this issue. It seems that pytorch appends -D_GLIBCXX_USE_CXX11_ABI=0 to compiler flags by default. I don't see a way to override it. And according to pytorch's PKGBUILD, there is no modify related to this flag. I believe that pytorch is build with -D_GLIBCXX_USE_CXX11_ABI=0, which is the default behavior of pytorch official configuration. So it would be OK to build apex extension with the same flag, aka -D_GLIBCXX_USE_CXX11_ABI=0. However, it's not. I try to build build apex with -D_GLIBCXX_USE_CXX11_ABI=1 by manually replace all -D_GLIBCXX_USE_CXX11_ABI=0 to -D_GLIBCXX_USE_CXX11_ABI=1 in /usr/lib/python3.7/site-packages/torch/utils/, and it works as expected. However, one should not expect to modify this during building apex with devtools, right?

leomao commented on 2019-02-25 03:10

Hi @hottea, thanks for reporting the issue. Could you provide a code snippet for testing? I just checked that the examples and tests ran without errors.

hottea commented on 2019-02-25 02:42

c++filt _ZN3c105ErrorC1ENS_14SourceLocationERKSs gives me:

c10::Error::Error(c10::SourceLocation, std::basic_string<char, std::char_traits<char>, std::allocator<char> > const&)

and then I found, it seems we need to add -D_GLIBCXX_USE_CXX11_ABI=0 when compiling apex.

hottea commented on 2019-02-25 02:04

I got this warning:

Warning:  using Python fallback for SyncBatchNorm, possibly because apex was ins
talled without --cuda_ext.  The exception raised when attempting to import the c
uda backend was:  /usr/lib/python3.7/site-packages/syncbn.cpython-37m-x86_64-lin undefined symbol: _ZN3c105ErrorC1ENS_14SourceLocationERKSs

It seems that it failed to build with --cuda_ext? Or maybe there is something wrong with libs?