Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replace sse2neon with native NEON #755

Merged
merged 1 commit into from
Feb 9, 2024
Merged

Conversation

argilo
Copy link
Member

@argilo argilo commented Feb 1, 2024

volk_8u_x4_conv_k7_r2_8u_neonspiral is the only protokernel that needs sse2neon, which has proven to be a troublesome dependency. Here I've rewritten the protokernel in native NEON, eliminating the need for sse2neon.

Now that the volk_8u_x4_conv_k7_r2_8u kernel is adequately tested, we can be confident that the updated protokernel is correct.

Performance is more or less unchanged:

Old:

neonspiral completed in 4557.97 ms
generic completed in 29817.1 ms

New:

neonspiral completed in 4591.51 ms
generic completed in 29832.3 ms

@jdemel
Copy link
Contributor

jdemel commented Feb 9, 2024

Merging this PR and waiting for #756 to be rebased. I appreciate that the code seems to simplify.

@jdemel jdemel merged commit 116fd8a into gnuradio:main Feb 9, 2024
33 checks passed
@argilo argilo deleted the remove-sse2neon-2 branch February 9, 2024 21:37
Alesha72003 pushed a commit to Alesha72003/volk that referenced this pull request May 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants