Group Abstract

Message Boards

WOLFRAM COMMUNITY

12K Views

3 Replies

13 Total Likes

View groups...

Follow this post

Share this post:

GROUPS:

Staff Picks Data Science Physics Curated Data Graphics and Visualization Import and Export Wolfram Language Units

The 2016 Atomic Mass Evaluation (AME2016) data

Estevao Teixeira

Posted 6 years ago

PART 1 - Parsing the data I was working with the AME2016 data and I decided to curate the data in order to insert it in the Wolfram Data Repository. Introduction The brief description of the data is: As a fundamental property of nuclei, atomic masses are widely used in many domains of science and engineering. A reliable atomic mass table derived from the experimental data, where the atomic masses and the relevant experimental information can be found conveniently, is in high demand by the research community. To meet the demands, the Atomic Mass Evaluation (Ame) was created in 1950's and now serves the research community by providing the most reliable and comprehensive information related to the atomic masses. (G. Audi, Int. J. Mass Spectr.251: 85 (2006) http://dx.doi.org/10.1016/j.ijms.2006.01.04) This data represents the newest published one in 2016. It is called AME2016. The data represents: Atomic mass list for analysis. This list contains the elements, mass excess, binding energy, beta-decay energy, atomic mass and more. The data itself can be found at Atomic Mass Data Center (AMDC) website: http://amdc.impcas.ac.cn/index.html \| http://amdc.impcas.ac.cn/web/masseval.html The data looks like this a file with 3426 lines of Atomic mass data. The txt file has 39 lines as header and every line of data has 124 characters. All this information is found in the data file. Importing the data file First of all, I import the file without the headers SetDirectory@NotebookDirectory[]; rawData = Import["mass16.dat", "Lines"][[40 ;;]]; Parsing the data file to a dataset Using the format defined by the article authors, we now how to work in each line of the file. Let's parse the file to a dataset. parsedData = Map[ <\| "CC" -> StringTake[#, {1, 1}], "NZ" -> StringTake[#, {2, 4}], "N" -> StringTake[#, {5, 9}], "Z" -> StringTake[#, {10, 14}], "A" -> StringTake[#, {15, 19}], "El" -> StringTake[#, {21, 23}], "o" -> StringTake[#, {24, 27}], "mass" -> StringTake[#, {29, 41}], "masserror" -> StringTake[#, {42, 52}], "binding" -> StringTake[#, {53, 63}], "bindingerror" -> StringTake[#, {64, 72}], "B" -> StringTake[#, {74, 75}], "beta" -> StringTake[#, {76, 87}], "betaerror" -> StringTake[#, {88, 96}], "atomicmass" -> StringTake[#, {97, 100}], "atomicmassu" -> StringTake[#, {101, 113}], "atomicmasserror" -> StringTake[#, {114, UpTo[124]}] \|> & , rawData] // Dataset; Data cleaning Now, we have to clean it and make it searchable. Cleaning whitespaces We remove all whitespaces in every fields. cleanData = Map[Map[If[StringLength[x = StringReplace[#, Whitespace -> ""]] < 1, Missing[], x] &, #] &, parsedData]; Helper functions These functions will help us in the cleaning process. For example, in the data, the authors defined the character "" to represent a "non-experimental" value. We will find these characters and fix other values. The "clearNonExperimentalSign"* removes "#" or "" from a string clearNonExperimentalSign[s_String] := StringReplace[s,("#"\|"")-> ""]; The "clearNotMissing" parses all the info that are not missing clearNotMissing[v_,unit_String,fixvalue_] := If[MissingQ@v,v,valueToQuantity[clearNonExperimentalSign@v,unit,fixvalue]]; The "matchNonExperimentalSign" finds all "#" characters and return if it is a "non-experimental" or "experimental" value. matchNonExperimentalSign[s_String] := If[StringMatchQ[s,___~~"#"~~___],"non-experimental","experimental"]; And finally, the "valueToQuantity" transform any possible value to a WL Quantity. valueToQuantity[s_,unit_String,fixvalue_] := If[NumberQ[ToExpression@s ],N@Quantity[ToExpression@s* fixvalue,unit],s] Final and parsed dataset Now, we parse all the previous data to a final dataset with all the information parsed. finalData = Map[ <\| "Mass List" -> <\| "Element" -> Quiet[IsotopeData[{ToExpression@#[["Z"]], ToExpression@#[["A"]]}]] /. _IsotopeData -> #[["El"]] <> #[["A"]], "Origin" -> #[["o"]], "Mass" -> <\| "value" -> valueToQuantity[clearNonExperimentalSign@#[["mass"]], "MeV", 10^-3], "error" -> valueToQuantity[clearNonExperimentalSign@#[["masserror"]], "MeV", 10^-3], "type" -> matchNonExperimentalSign@#[["mass"]] \|>, "Binding" -> <\| "value" -> valueToQuantity[clearNonExperimentalSign@#[["binding"]], "MeV", 10^-3], "error" -> valueToQuantity[clearNonExperimentalSign@#[["bindingerror"]], "MeV", 10^-3], "type" -> matchNonExperimentalSign@#[["binding"]] \|>, "BetaDecay" -> <\| "type" -> #[["B"]], "value" -> clearNotMissing[#[["beta"]], "MeV", 10^-3], "error" -> clearNotMissing[#[["betaerror"]], "MeV", 10^-3] \|>, "AtomicMass" -> <\| "A" -> #[["atomicmass"]], "N" -> #[["N"]], "Z" -> #[["Z"]], "value" -> valueToQuantity[clearNonExperimentalSign@#[["atomicmassu"]], "u", 10^-6], "error" -> valueToQuantity[clearNonExperimentalSign@#[["atomicmasserror"]], "u", 10^-6], "type" -> matchNonExperimentalSign@#[["atomicmassu"]] \|> \|> \|> &, cleanData ]; finalData I submitted this data for the Wolfram Data Repository. When they give me a link I will update this post with it. The next part of this post is to compare this dataset with the data in Wolfram Language. PART 2 - Comparing with WL In the last text we got the parsed dataset. Here, I reduce it to work just with some fields elementMasses = finalData[All, 1, {"Element", "Mass", "AtomicMass"}] Ideas Wolfram Language (WL) has a function called IsotopeData. One can retrieve a lot of information about isotopes. One of the information that we will try to compare is the mass excess. Let`s see the difference between the last published data and the data in WL. Comparing the MassExcess value column with the data in Wolfram Language. To plot, we need still less information. Because of that, we create the following variable ReducedData = finalData[All, 1, {"Element", "MassExcess", "AtomicMass"}]; ExampleListToPlot = {#Element, ToExpression@#AtomicMass[["N"]], #MassExcess[["Value"]]} & /@ ReducedData[[2 ;;]]; Then, we create a new list with those elements that are not in WL MissingElmentsInWL = Query[Select[StringQ[#["Element"]] &]] @ ReducedData[2 ;;]; ListMissingElementsWL = {#Element, ToExpression@#AtomicMass[["N"]], #MassExcess[["Value"]]} & /@ MissingElmentsInWL; Now, we plot all the Mass Excess from All elements. The blue dots are the elements in this data resource, the red ones are the ones missing in WL. ListPlot[{ExampleListToPlot[[All, {2, 3}]], ListMissingElementsWL[[All, {2, 3}]]}, PlotRange -> All, PlotStyle -> {{}, {Red}}, AxesLabel -> {"N (Neutrons)", "Mass excess (MeV)"}, PlotLegends -> {"Isotopes in both Data Resource and WL", "Missing Isotopes in WL"}, ImageSize -> 600] The first conclusion: There is 253 isotopes missing in Wolfram language. Difference between the mass excess in both data Now, I wanted to check the difference between the data in each resource. I will create two list, one with the isotopes in the AME2016 data and the other with the isotopes that are in WL articleCleanDataset = Query[Select[Not@StringQ[#["Element"]] &]] @ ReducedData[2 ;;]; mathematicaData = <\| "Element" -> #Element, "N" -> ToExpression@#AtomicMass[["N"]], "WLMassExcess" -> #Element["MassExcess"], "MassExcess" -> #Mass[["value"]], "MassDiff" -> #Mass[["value"]] - #Element["MassExcess"] \|> & /@ articleCleanDataset Lets get the cases where we have a huge difference between our data and WL data. Lets show the ones that are higher than 2 MeV Query[Select[Abs@#MassDiff > Quantity[2, "MeV"] & ]] @ mathematicaData Then, we replace all missing isotopes in WL as if they have zero mass listMissingElementsToPlot = {#[[2]], Quantity[0, "MeV"]} & /@ ListMissingElementsWL; and we plot it plotDifferenceBetweenMasses = ListPlot[{mathematicaData[[All, {2, 5}]], listMissingElementsToPlot}, PlotRange -> All, PlotStyle -> {{}, {Red, PointSize[0.004]}}, AxesLabel -> {"N (Neutron(s))", "(MeV)"}, PlotLabel -> "Mass excess difference between Ref1 and Wolfram Language values \ in MeV", PlotLegends -> {"Ref#1 - WL", "Missing values in WL"}, ImageSize -> 1000 ] Conclusions I did this "mini-study" to compare the newest data in the area with the values provided by the Wolfram Language. One could see that WL needs to update their data. It is useful to play with it but it is not the good one to work with. I searched in their documentation and they say that the most recent data is from 2007. The data that I worked in this post is from 2016, the most recent one. The next data is expected to be released in 2022. In conclusion, It would be great if WL has the most recent values to work with. I hope that you had enjoyed the post. Thanks.

POSTED BY: Estevao Teixeira

3 Replies

Sort By:

Estevao Teixeira

Posted 6 years ago

Hi, Alexander, A good thing that you can do is to parse the AME2016 file and use their values, as I did first. $$Data was a mistake. I changed to finalData. I submitted this dataset to the Data Repository but it is not available yet. You can check the notebook at https://github.com/sbrno/AME2016_WolframLanguage.

POSTED BY: Estevao Teixeira

Alexander Lozovsky

Posted 6 years ago

Estevao, thank you for your "mini-study"! And I agree with your conclusion. Unfortunately, Wolfram's data may be inconsistent even within itself different data functions may provide different values for the same entity. E.g., when using functions like `Quantity`, `Entity` (with `"Particle"`/`"PhysicalConstant"`) or `ParticleData` you get inconsistent values for the proton's mass or elementary charge, which seems to affect the results of `UnitConvert` function. I came upon it when trying to calculate the mass/energy change in a hydrogen atom formation reaction. So, currently Wolfram's data cannot be relied upon. P.S. What is the `$$Data` variable supposed to denote in your code? Can you provide a downloadable version of your study (as an .nb fiile)?

POSTED BY: Alexander Lozovsky

EDITORIAL BOARD

EDITORIAL BOARD, WOLFRAM

Posted 6 years ago

- Congratulations! This post is now featured in our Staff Pick column as distinguished by a badge on your profile of a Featured Contributor! Thank you, keep it coming, and consider contributing your work to the The Notebook Archive! Please note, we consolidated your two separate posts in one as they are closely related.

POSTED BY: EDITORIAL BOARD

Reply to this discussion

Reply Preview

Attachments

Remove Add a file to this post

Follow this discussion

or Discard

Feedback

The 2016 Atomic Mass Evaluation (AME2016) data

PART 1 - Parsing the data

Introduction

Importing the data file

Parsing the data file to a dataset

Data cleaning

Cleaning whitespaces

Helper functions

Final and parsed dataset

PART 2 - Comparing with WL

Ideas

Comparing the MassExcess value column with the data in Wolfram Language.

Difference between the mass excess in both data

Conclusions