[stdlib] Add method `atof()` to `String` #2649

fknfilewalker · 2024-05-14T11:16:06Z

This PR adds a function that can convert a String to a Float64. Right now it is implemented just for Float64 but maybe we should add other precisions?

This supports the following notations:

"-1236.233"
"2.25"
"2."
"1.7E+3"
# as well as the f/F postfix notation
"-1236.233f"
"2.25F"
"2.f"
"1.7E+3F"

Moosems · 2024-05-14T11:42:58Z

I think this also needs tests

fknfilewalker · 2024-05-14T18:40:19Z

What do I have to do to make the 'Standard Library tests and examples' test pass?

artemiogr97 · 2024-05-14T21:50:35Z

What do I have to do to make the 'Standard Library tests and examples' test pass?

@fknfilewalker seems like some files are not well formatted, try running mojo format . and commit your changes

artemiogr97 · 2024-05-15T05:13:26Z

@fknfilewalker there is still the file 'test_string' that needs to be formated, that's why the test are failing

Signed-off-by: Lukas Lipp <llipp@cg.tuwien.ac.at>

Moosems · 2024-05-15T11:48:20Z

@JoeLoser Would you be willing to review this or get someone from the proper team? Hope you're having a good day!

Signed-off-by: Lukas Lipp <llipp@cg.tuwien.ac.at>

laszlokindrat

Thank you for the patch, this is gonna be awesome! I have a few suggestions and quibbles, but it's generally looking nice!

stdlib/src/builtin/string.mojo

laszlokindrat · 2024-05-16T16:31:56Z

stdlib/src/builtin/string.mojo

+    var shift: Int = 10 ** abs(exponent)
+    if exponent > 0:
+        result *= shift
+    if exponent < 0:
+        result /= shift


Since result is a Float64 this will implicitly convert shift anyway. Does this work?

Suggested change

var shift: Int = 10 ** abs(exponent)

if exponent > 0:

result *= shift

if exponent < 0:

result /= shift

var result *= 10.0 ** exponent

the problem here is that mojo rounds the floats differently, and doing a division for negative exponents is closer to how mojo does it.
e.g., -0.3
mojo stores it as -0.29999999999999999
while the multiply method (3.0 * 0.1) returns -0.30000000000000004
the division method (3.0 / 10.0) produces -0.29999999999999999

so the result is different depending on whether doing a * 10 or / 0.1 or vice versa

A similar strange situation is the following

print(233.0 / 1000.0) #prints 0.23299999999999998 var x: Float64 = 233.0 var div: Float64 = 1000.0 print(x/div) #prints 0.23300000000000001

If this is a compiler issue and we can fix that then this suggestion would be definitely the better option

Okay, this makes sense actually. Can you please ensure we have at least one test case that hits this? Please leave a comment both here and at that test case to explain this. Thank you!

stdlib/src/builtin/string.mojo

stdlib/test/builtin/test_string.mojo

Co-authored-by: Laszlo Kindrat <laszlokindrat@gmail.com> Signed-off-by: Lukas Lipp <15105596+fknfilewalker@users.noreply.github.com>

Signed-off-by: Lukas Lipp <llipp@cg.tuwien.ac.at>

laszlokindrat · 2024-05-16T17:41:37Z

Also, please add a changelog entry!

Signed-off-by: Lukas Lipp <llipp@cg.tuwien.ac.at>

Signed-off-by: Lukas Lipp <15105596+fknfilewalker@users.noreply.github.com>

Signed-off-by: Lukas Lipp <llipp@cg.tuwien.ac.at>

[External] [stdlib] Add method `strip()` to `StringRef` This PR adds a `strip` method to `StringRef`. This PR is helpful for #2649. Where can I find the test cases for StringRef? ORIGINAL_AUTHOR=Lukas Lipp <15105596+fknfilewalker@users.noreply.github.com> PUBLIC_PR_LINK=#2683 Co-authored-by: Lukas Lipp <15105596+fknfilewalker@users.noreply.github.com> Closes #2683 MODULAR_ORIG_COMMIT_REV_ID: 5ac8b1f3b45c75a964c5d9368e3871e7fc617a88

Signed-off-by: Lukas Lipp <15105596+fknfilewalker@users.noreply.github.com>

Signed-off-by: Lukas Lipp <llipp@cg.tuwien.ac.at>

laszlokindrat

Great patch, thank you so much!

laszlokindrat · 2024-05-17T20:13:42Z

@JoeLoser Would it make sense to make this functionality parametric on a floating point type? I.e. something like

fn atof[type: Dtype = Dtype.float64](str: String) raises -> Scalar[dtype]:
    constrained[type.is_float(), "must be float"]()
    ...

laszlokindrat · 2024-05-17T20:15:19Z

@fknfilewalker If you are interested, you could also do an overload for StringLiterals that returns arbitrary precision floats:

fn atof(str_literal: StringLiteral) -> FloatLiteral: ...

JoeLoser · 2024-05-17T20:18:14Z

@JoeLoser Would it make sense to make this functionality parametric on a floating point type? I.e. something like
fn atof[type: Dtype = Dtype.float64](str: String) raises -> Scalar[dtype]:
    constrained[type.is_float(), "must be float"]()
    ...

Is your goal to make it work for arbitrary-precision floats, or something else?

laszlokindrat · 2024-05-17T20:21:21Z

@JoeLoser Would it make sense to make this functionality parametric on a floating point type? I.e. something like
fn atof[type: Dtype = Dtype.float64](str: String) raises -> Scalar[dtype]:
    constrained[type.is_float(), "must be float"]()
    ...
Is your goal to make it work for arbitrary-precision floats, or something else?

Making it work for FloatLiteral would be nice, but there might be use cases where it's important not to have to go through Float64 if wanting to go to Float32 or Float16 (because it might actually give different results due to rounding).

modularbot · 2024-05-17T21:28:29Z

✅🟣 This contribution has been merged 🟣✅

Your pull request has been merged to the internal upstream Mojo sources. It will be reflected here in the Mojo repository on the nightly branch during the next Mojo nightly release, typically within the next 24-48 hours.

We use Copybara to merge external contributions, click here to learn more.

modularbot · 2024-05-18T05:48:24Z

Landed in 8641d49! Thank you for your contribution 🎉

[External] [stdlib] Add method `atof()` to `String` This PR adds a function that can convert a `String` to a `Float64`. Right now it is implemented just for Float64 but maybe we should add other precisions? This supports the following notations: ```python "-1236.233" "2.25" "2." "1.7E+3" # as well as the f/F postfix notation "-1236.233f" "2.25F" "2.f" "1.7E+3F" ``` ORIGINAL_AUTHOR=Lukas Lipp <15105596+fknfilewalker@users.noreply.github.com> PUBLIC_PR_LINK=#2649 Co-authored-by: Lukas Lipp <15105596+fknfilewalker@users.noreply.github.com> Closes #2649 MODULAR_ORIG_COMMIT_REV_ID: b8d2c4ef38faa639e749957a0c1ba1a9c02a28cf

fknfilewalker · 2024-05-18T10:44:51Z

@fknfilewalker If you are interested, you could also do an overload for StringLiterals that returns arbitrary precision floats:
fn atof(str_literal: StringLiteral) -> FloatLiteral: ...

Sure, is there a reason why we have the inner _atof function (same for _atol)?

fknfilewalker · 2024-05-18T11:00:35Z

@JoeLoser Would it make sense to make this functionality parametric on a floating point type? I.e. something like
fn atof[type: Dtype = Dtype.float64](str: String) raises -> Scalar[dtype]:
    constrained[type.is_float(), "must be float"]()
    ...
Is your goal to make it work for arbitrary-precision floats, or something else?
Making it work for FloatLiteral would be nice, but there might be use cases where it's important not to have to go through Float64 if wanting to go to Float32 or Float16 (because it might actually give different results due to rounding).

In order for this to work with FloatLiteral we would need to use StringLiteral I guess? How does it work to create a function for runtime values and compile time values?

fknfilewalker · 2024-05-18T19:17:51Z

Could atof() accept a Stringable instead of a String? Then everything could be used?

[External] [stdlib] Add method `strip()` to `StringRef` This PR adds a `strip` method to `StringRef`. This PR is helpful for modularml#2649. Where can I find the test cases for StringRef? ORIGINAL_AUTHOR=Lukas Lipp <15105596+fknfilewalker@users.noreply.github.com> PUBLIC_PR_LINK=modularml#2683 Co-authored-by: Lukas Lipp <15105596+fknfilewalker@users.noreply.github.com> Closes modularml#2683 MODULAR_ORIG_COMMIT_REV_ID: 5ac8b1f3b45c75a964c5d9368e3871e7fc617a88

[External] [stdlib] Add method `atof()` to `String` This PR adds a function that can convert a `String` to a `Float64`. Right now it is implemented just for Float64 but maybe we should add other precisions? This supports the following notations: ```python "-1236.233" "2.25" "2." "1.7E+3" # as well as the f/F postfix notation "-1236.233f" "2.25F" "2.f" "1.7E+3F" ``` ORIGINAL_AUTHOR=Lukas Lipp <15105596+fknfilewalker@users.noreply.github.com> PUBLIC_PR_LINK=modularml#2649 Co-authored-by: Lukas Lipp <15105596+fknfilewalker@users.noreply.github.com> Closes modularml#2649 MODULAR_ORIG_COMMIT_REV_ID: b8d2c4ef38faa639e749957a0c1ba1a9c02a28cf

laszlokindrat · 2024-05-20T14:23:03Z

Sure, is there a reason why we have the inner _atof function (same for _atol)?

I don't think so, maybe the implementation used to be different and this was left over.

In order for this to work with FloatLiteral we would need to use StringLiteral I guess?

Yes.

Could atof() accept a Stringable instead of a String?

Yes, but notice that the overload I suggested returns a FloatLiteral, not a Scalar[...]:

fn atof(str_literal: StringLiteral) -> FloatLiteral: ...

FloatLiteral is arbitrary precision, and they idea here would be that this function can only be invoked at compile time. For this to work, we need to make StringLiteral @nonmaterializable, which is something I'm working on.

[External] [stdlib] Add method `strip()` to `StringRef` This PR adds a `strip` method to `StringRef`. This PR is helpful for modularml#2649. Where can I find the test cases for StringRef? ORIGINAL_AUTHOR=Lukas Lipp <15105596+fknfilewalker@users.noreply.github.com> PUBLIC_PR_LINK=modularml#2683 Co-authored-by: Lukas Lipp <15105596+fknfilewalker@users.noreply.github.com> Closes modularml#2683 MODULAR_ORIG_COMMIT_REV_ID: 5ac8b1f3b45c75a964c5d9368e3871e7fc617a88

[External] [stdlib] Add method `atof()` to `String` This PR adds a function that can convert a `String` to a `Float64`. Right now it is implemented just for Float64 but maybe we should add other precisions? This supports the following notations: ```python "-1236.233" "2.25" "2." "1.7E+3" "-1236.233f" "2.25F" "2.f" "1.7E+3F" ``` ORIGINAL_AUTHOR=Lukas Lipp <15105596+fknfilewalker@users.noreply.github.com> PUBLIC_PR_LINK=modularml#2649 Co-authored-by: Lukas Lipp <15105596+fknfilewalker@users.noreply.github.com> Closes modularml#2649 MODULAR_ORIG_COMMIT_REV_ID: b8d2c4ef38faa639e749957a0c1ba1a9c02a28cf

fknfilewalker requested a review from a team as a code owner May 14, 2024 11:16

fknfilewalker changed the title ~~An atof implementation (String to Float)~~ [stdlib] Add method atof() to String May 14, 2024

fknfilewalker changed the title ~~[stdlib] Add method atof() to String~~ [stdlib] Add method atof() to String May 14, 2024

Add atof

8673189

Signed-off-by: Lukas Lipp <llipp@cg.tuwien.ac.at>

fknfilewalker force-pushed the atof branch from 65c3c2e to 8673189 Compare May 15, 2024 09:04

Merge branch 'nightly' into atof

1e8ae3c

JoeLoser assigned laszlokindrat May 15, 2024

fknfilewalker and others added 6 commits May 15, 2024 17:40

Better version of atof + more tests

fc95098

Signed-off-by: Lukas Lipp <llipp@cg.tuwien.ac.at>

Make sure there is a number after 12.3E

b58b737

Signed-off-by: Lukas Lipp <llipp@cg.tuwien.ac.at>

More tests

915e662

Signed-off-by: Lukas Lipp <llipp@cg.tuwien.ac.at>

Merge branch 'nightly' into atof

059e050

Merge branch 'nightly' into atof

e6f2b45

Cleanup and test for -- cases

c5d6395

Signed-off-by: Lukas Lipp <llipp@cg.tuwien.ac.at>

laszlokindrat reviewed May 16, 2024

View reviewed changes

fknfilewalker and others added 4 commits May 16, 2024 19:11

Merge branch 'nightly' into atof

f073d0e

Simplify through inversion

cec6de4

Co-authored-by: Laszlo Kindrat <laszlokindrat@gmail.com> Signed-off-by: Lukas Lipp <15105596+fknfilewalker@users.noreply.github.com>

_atof_error return error directly

e44fb0b

Co-authored-by: Laszlo Kindrat <laszlokindrat@gmail.com> Signed-off-by: Lukas Lipp <15105596+fknfilewalker@users.noreply.github.com>

Additional changes for error fix

7d89c44

Signed-off-by: Lukas Lipp <llipp@cg.tuwien.ac.at>

fknfilewalker mentioned this pull request May 16, 2024

[stdlib] Add method strip() to StringRef #2683

Closed

update changelog

f0d9f15

Signed-off-by: Lukas Lipp <llipp@cg.tuwien.ac.at>

fknfilewalker requested a review from a team as a code owner May 16, 2024 19:38

fknfilewalker and others added 3 commits May 16, 2024 23:53

refactor many casts with direct buffer access

fcd97db

Signed-off-by: Lukas Lipp <llipp@cg.tuwien.ac.at>

Merge branch 'nightly' into atof

a7f2c8b

Signed-off-by: Lukas Lipp <15105596+fknfilewalker@users.noreply.github.com>

check for dual ++ and multiply sign without branching

00d1cd0

Signed-off-by: Lukas Lipp <llipp@cg.tuwien.ac.at>

check for inf and nan

f173c0f

Signed-off-by: Lukas Lipp <llipp@cg.tuwien.ac.at>

fknfilewalker and others added 3 commits May 17, 2024 10:12

Merge branch 'nightly' into atof

533d21c

Signed-off-by: Lukas Lipp <15105596+fknfilewalker@users.noreply.github.com>

use stringref strip now

1f41593

Signed-off-by: Lukas Lipp <llipp@cg.tuwien.ac.at>

retrigger checks

acb0d31

Signed-off-by: Lukas Lipp <llipp@cg.tuwien.ac.at>

laszlokindrat added the imported-internally Signals that a given pull request has been imported internally. label May 17, 2024

laszlokindrat approved these changes May 17, 2024

View reviewed changes

modularbot added merged-internally Indicates that this pull request has been merged internally merged-externally Merged externally in public mojo repo labels May 17, 2024

modularbot closed this May 18, 2024

fknfilewalker deleted the atof branch May 18, 2024 08:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[stdlib] Add method `atof()` to `String` #2649

[stdlib] Add method `atof()` to `String` #2649

fknfilewalker commented May 14, 2024 •

edited

Moosems commented May 14, 2024

fknfilewalker commented May 14, 2024 •

edited

artemiogr97 commented May 14, 2024

artemiogr97 commented May 15, 2024 •

edited

Moosems commented May 15, 2024

laszlokindrat left a comment

laszlokindrat May 16, 2024

fknfilewalker May 16, 2024 •

edited

laszlokindrat May 16, 2024

laszlokindrat commented May 16, 2024

laszlokindrat left a comment

laszlokindrat commented May 17, 2024

laszlokindrat commented May 17, 2024

JoeLoser commented May 17, 2024

laszlokindrat commented May 17, 2024

modularbot commented May 17, 2024

modularbot commented May 18, 2024

fknfilewalker commented May 18, 2024 •

edited

fknfilewalker commented May 18, 2024

fknfilewalker commented May 18, 2024

laszlokindrat commented May 20, 2024

[stdlib] Add method atof() to String #2649

[stdlib] Add method atof() to String #2649

Conversation

fknfilewalker commented May 14, 2024 • edited

Moosems commented May 14, 2024

fknfilewalker commented May 14, 2024 • edited

artemiogr97 commented May 14, 2024

artemiogr97 commented May 15, 2024 • edited

Moosems commented May 15, 2024

laszlokindrat left a comment

Choose a reason for hiding this comment

laszlokindrat May 16, 2024

Choose a reason for hiding this comment

fknfilewalker May 16, 2024 • edited

Choose a reason for hiding this comment

laszlokindrat May 16, 2024

Choose a reason for hiding this comment

laszlokindrat commented May 16, 2024

laszlokindrat left a comment

Choose a reason for hiding this comment

laszlokindrat commented May 17, 2024

laszlokindrat commented May 17, 2024

JoeLoser commented May 17, 2024

laszlokindrat commented May 17, 2024

modularbot commented May 17, 2024

modularbot commented May 18, 2024

fknfilewalker commented May 18, 2024 • edited

fknfilewalker commented May 18, 2024

fknfilewalker commented May 18, 2024

laszlokindrat commented May 20, 2024

[stdlib] Add method `atof()` to `String` #2649

[stdlib] Add method `atof()` to `String` #2649

fknfilewalker commented May 14, 2024 •

edited

fknfilewalker commented May 14, 2024 •

edited

artemiogr97 commented May 15, 2024 •

edited

fknfilewalker May 16, 2024 •

edited

fknfilewalker commented May 18, 2024 •

edited